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INCENTIVE MAGNITU 


PERFORMANCE 


BENJAMIN H 


Universit 


Interest in the effects of vari 


itions 


in incentive magnitude on learning 
and performance in animals has in 
7 
creased markedly during the last 
decade, and there are now ore t! 
75 experimental papers deciing with 
various aspects of this top: These 
papers have been ct ncerned ior the 
most part, with the following prob 
lems 
1. What is the effect of ent ‘ agni r 
1 rate of learning and a [ pe 
ance? 
2. Does variation in incentive ig le 
affect learning, performance both? 
3. What is the effect of entuy ig 
on resistance to experimental ext 
4. Does incentive igni n tera “ 
any other parameters of reinforce , 
if so, in what way? 
5. What is the nature the r 
or dimension of reinforcement, whose quant 
itive Vv iriation prod ices behav ' ha ges 
6. Are there any differences in performance 
ter s of whether ag ‘ exypx es 
e incentive magnituce L give 
) e tha ca Wing i ) 
The terms incentive, reward, a re 
cement are used interchangeal in th 
paper, to refer to stimuli which strengther 
responses they follow, without implication of 
subscription to any particular theoretical 
position. Such phrases as “variation in in 
centive magnitude,” “quantitative varia- 


tior and the like, are also used interch inge 
ibly Wisconsin General Test 
viated WGTA; periodic re 
ited PR 

* The author would like to express, his ar 
to A. M. Schrier for his crit 


reading of a prelimin th 


Apparatus is 
abbre I ? 


abbrevi 


rcement 1 


preciation 


ary version of per, 


ye) 
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What is the effect of incentive magnitude 


acquired reward value? 


This paper will follow an outline 
suggested by these questions, and in 
attempt will be made to 

conclusions as definite as the 
ivailable evidence allows. 

A brief review of some general 
methodological considerations would 


eacn case an 


araw 


seem to be in order to help provide a 
framework for the discussions which 
will follow. A variety of types of ex- 
perimental operation have been used 
by different in the 

iipulation of incentive magnitude. 
hese operations have tended to fall 


investigators 


into one of five classes, usually in- 
volving the manipulation of food in 
some form, but sometimes water, or 


receptive female of the same spe- 
CLics. 

These classes of operations are: 
ariation in weight, volume, or 
incentive unit; (0) 
variation in number of equal-weight 
incentive units; (c) variation in dura- 
tion of exposure to the incentive; (d) 


a \ 


size ol a single 


variation in concentration of sugar, 
isually sucrose, solutions; and (e) 
variation in incentives which do not 
ilter the effects of deprivation, such 
is concentration of saccharine, or ex- 
tent of incomplete sexual behavior. 


It should be apparent that the 
ial dimensions, or mechanisms, 
of quantitative variation are several, 
and that they may differ among 
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these classes of operations. Guttman 
(1953) has pointed out that the ex- 
pression, ‘quantity of reinforcing 
agent,’’ may refer to several dimen- 
sions of variation. He lists these as 
follows: (2) amount of nutrient ma- 
terial available for assimilation, in 
terms of weight or volume; (6) stimu- 
lation (primarily visual, but may also 
be olfactory or tactile) derived from 
the incentive prior to consummatory 
behavior; (c) amount and nature of 
consummatory activity 
and (d) stimulation from the incen- 


involved: 


tive during consummation (e.g., taste 
characteristics). 

Kling (1956) has further indicated 
that the third factor, consummatory 
activity, may be broken down into 
several which 
number of consummatory responses, 


subdimensions, are: 


duration of consummatory activity, 
ratio of consummatory to noncon- 


summatory goal 
area, and rate of consummatory re- 
sponding. 

Different varia- 
tion result from different operations. 
Thus, the two classical operations, 


responses in the 
combinations of 


manipulation of weight and number, 
each involve simultaneous variation 
along all four dimensions. Manipula- 
tion of duration of incentive ex 
typically involves ch 


posure 
inges in amount 
of nutrient material and consumma- 
tory activity. The 
sugar 
comitant 
nutrient material and consummatory 
stimulation. And the fifth operation 
may lead te concurrent variation in 
amount of activity 
and stimulation during that activity. 
The consideration of differences in ex- 
perimental manipulations will be 
central to the discussion of mecha- 
nisms of reir forcement. 

A second category of methodologi- 
cal differences is related to another 
question raised above. This concerns 


1anipulation of 


concentration involves con- 


variation in amount of 


consummatory 
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the number of incentive magnitudes 
a given S experiences. Lawson (1957) 
calls the method whereby each S ex- 
periences only one value the “‘abso- 
lute’’ method, and the method 
whereby each S experiences more 
than value the ‘‘differential”’ 
method. For the most part, studies 
utilizing the differential method will 


one 


be considered in a separate section. 

To answer the question of whether 
variations in incentive magnitude 
affect learning or performance, a 
special two-phase experimental de- 
sign will be required. This will be 
outlined in the appropriate section. 
However, device for 
between learning and 
which will be 
adopted in this paper, requires com- 


another dis- 
tinguishing 

performance efiects, 
This is the use of meas- 
involving 


ment now. 


ures time (e.g., latency, 
running time, speed of response, rate 
measures of 


of responding) as per- 


formance, and the use of time-inde- 
pendent measures (e.g., errors, trials 
to criterion) as measures of learning. 
Although there are perhaps no a 
priori grounds for making this as- 
signment, the review of the litera- 
ture which follows should bear out 
its worth. 

It will be noted that certain topics 
are being omitted which might seem 
relevant to the 

For example, theoretical 
is are minimized. This 
that the writer feels 
them to be unimportant or unsuccess- 
ful. Rather, the aim of the present 
paper is to attempt to order the em- 


" 


pirical evidence regarding incentive 
tude, so that the theorist will 
have a clearer picture of the data 
with which he will work. That is, 
the paper will attempt an empirical, 
rather than a theoretical, integration 
Theoretical discussions will be found 
in most of the papers to be reviewed, 
especially the following: 


issues discussed in 
this paper 
leratior 


col ¢ t 


is not to say 


Magni 


Crespi 
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(1944); Hull (1943, pp. 124-134; 
1952, pp. 140-148); Meyer (1951); 
Pereboom (1957b); Reynolds (1949 
and Spence (1956, pp. 127-148) 

Also omitted latent learning 
the Blodgett type (e.g., 
1929; Tolman & 
hey are espec ially pertinent 


are 
studies of 
Blodgett, 
1930). 


Honzig, 


to the present paper, as it is possible 


to interpret them as studies invols 
ing a change in incentive magnitude, 
a minimal amount to a larger 
W he i Bt) 


preted, their results agree quite well 
with the results of 


from 


specihable amount inter- 


studies to be re- 


' 


Howeve r, the latent 


have been 


viewed 
ing experiments 
qu itelv reviewed elsewhere 


of times (e.g Phistlethwaite 


studies in which reu 


Finally, 


ment is administered other 
peripherally (e.g 


by stomach fistula) are e 


ints 


other words, this review will be re- 


stricted to studies in which 1 


ire administered 


centives 
peripherally, and 
involving 


tors 


exteroce] 


RATE OF LEARNING AND ASYMP- 
TOTIC PERFORMANCI 


I he evidence to be reported in th 
section will be based on situations i 
which 


inimais are scries 


rewarded traini iw ider one 
several quantit f reinforceme: 
ind measurements are made of tet 
minal performance level 

approach to this level. I 

there is not sufficiet 

cases that these 
tually represent 
formance. Nevertheless 

ure ol periormance overt the fir 
training trials is usually giver 


Many 


differences in rate 


investigators have taken 


proar h to as- 


ol apy 


ymptotic performance level as a func- 
lawnitude to reflect 
, and dif- 


ferent asymptotes to reflect a differ 


tion of incentive 1 


an effect on rate of learning 
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performance While the 
former interpretation may be legiti- 
mate, the latter definitely is not. A 
higher terminal level could be an 
indication of a greater amount 
learned, or that amount of learning is 
ime but performance is superior. 

Relevant manipulations have in- 
cluded variations in _ incentive 
smount by all five of the operations 
outlined in the introduction. These 
will be considered in turn, and, unless 
otherwise the absolute 
vethod of incentive presentation was 


used. 


ence in 


, 
thes 


stated, 


‘artation in 


Weight and Number of 


the first 
intitative variation Grind- 
reported in 1929. He trained 
groups of chicks to run down a 
unway to either 0, 1, 2, 4, or 6 grains 
of boiled rice. When reciprocal run- 
ng times on the last five of seven 


Apparently study of 


was 


trials were plotted against the num- 
ber of grains of rice, an increasing, 
negatively accelerated curve was ob- 
ined. It is uncertain just what this 
represents, however. That it 
represents asymptotic performance 
with so few trials is questionable. 
Wolfe and Kaplon (1941) also 
used chicks as Ss. Three groups were 
in successively on each of three 


probiems, 


curve 


a runway, detour problem, 
and T maze. Running times through- 
out the 25 or 35 trials given on each 
problem fell in the following order, 
shortest to longest, for the three in- 

ntive amounts used: four quarter- 
grains of popcorn, one full grain, and 
one quarter-grain. Critical ratios on 
the final training days (consisting of 
five trials) indicated no significant 
differences between groups, but sev- 
eral approaching significance in com- 
parisons involving the one quarter- 
grain group. Inspection of the Wolfe 
and Kaplon curves indicates a similar 
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rate of approach to the asymptotes 
among all groups on all problems. 

Crespi (1942) gave runway train- 
ing to various groups of rats, with 
the number of incentive units given 
as reward varied in logarithmic steps. 
Each incentive unit weighed .02-gm. 
and the numbers of units used were 
1, 4, 16, 64, and 256. Asymptotic 
running speeds (Trials 21-25 in one 
experiment, Trials 16-20 in another) 
were an approximately logarithmic 
function of the number of incentive 
units. On the other hand, rate of ap- 
proach to these asymptotes was ap- 
proximately constant for the various 
incentive values These findings, 
that the amount of incentive affects 
performance at asymptote but not 
rate of approach to the asymptote, 
have with few exceptions been con- 
firmed in later studies using the ab- 
solute method of incentive presenta- 
tion. 

Continuing the runway 
studies, Zeaman (1949) y the 
weight of single incentive units as 
follows: .05-, .20-, .40-, .80-, 1.60-, 
and 2.40-gm. Eighteen or 19 daily 
trials were given, and equations were 
fitted describing the decrease in log 
latency over successive trials, treat- 
ing Trials 14-19 as 
Equation constants representing as- 
ymptotic performance differed sig- 
nificantly from while 
those representing rate of approach, 
or the slope constants, were nearly 
identical. 
clear: Quantitative variation affects 
terminal level of performance but 
not rate of approach to this level. A 
plot of asymptotic log latency as a 
function of log grams of food yields a 
decreasing function with slight posi- 
tive acceleration over the range in- 
vestigated. 

Additional runway studies have 
corroborated these findings. Among 


with 


iried 


asymptotic. 
each 


other, 


Conclusions are again 


these are the studies of Lawrence and 
Miller (1947), Metzger, Cotton, and 
Lewis (1957), and Spence (1956, pp. 
130-132). 

Two other runway studies require 
brief comment. Pereboom and Craw- 
ford (1958) measured both forward 
running time and competing response 
time over 40 trials, and found that 
incentive magnitude affects both var- 
iables, the latter more so. These re- 
sults suggest that a good deal of the 
“learning’’ that is shown by a de 
crease in running time (or changes in 
related time measures) may actually 
reflect the elimination of competing 
response tenden¢ 1€s, such as explora 
the like, rather than a 
marked decrease in forward running 


tion, and 
time 

1941 
rate of acquisition and a decrease in 
terminal log latency for eight train- 


Gagné found an increase in 


ing trials, as a function of incentive 


amount. However, for purposes of 


study, incentive amount 
with 
the longer 


with 


this was 


inter- 


completely confounded 
that 


assoc iated 


trial interval such 


rest intervals were 


larger amounts. The faster learning 
rate for larger amounts may then be 
due to the distribution of practice 


effect, and 


not to the amount of in- 
centive received 

A study by Hutt (1954) utilizing 
the bar confirms 
the runway finding that asymptoti 
performance is positively related to 
Hutt varied 
id quality of incen- 
issessed their ef- 
fects on rate of responding under PR 
The three quantities were 3-, 12-, and 
50-mgm., manipulated by varying 
the size of the food dipper, and the 
three qualities were a basic diet plus 
saccharine preferred), 
diet alone, and basic diet plus citric 
acid (least preferred). The animals 


pressing response 


incentive magnitude. 


both quantity ar 


tive factorially and 


(most basic 
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had first received training under con 
tinuous reward, and cumulative re- 
under PR are 


sponse curves essen- 


tially linear Thus it can be con 


cluded that the 


effects of both quantity and quality 


obtained significant 


represent differences in asymptoti 


performance [he differences were 


in the expected direction in both 
cases, 

Several 
more cor ipley 
discriminatior 
len .) Rev 
groups ol rats on 
crimination with 
unit weighing ei 
found th 
sponse times 


fered 


trials to criterior 


and 
signin 


means being withi 
other. Other invest 
Schrier, 1956a 


discrimination Ie 


1955; 
stantial ayreet 

In another 
(1950a comt 
le T i" 


number 


simy 
ot tr 

unit weighing ¢ 
He found 


response . 


i rea 


lor the 


1953 
multipl 
four levels 
Amount 
ellect on ¢ 


nis per 


errors or | 


(running time 
1951) 


and groups of rats ur 


employed ‘ hive-ul maze 
der either high 
or low thirst drive. The high drive 
Ss drank significantly more water in 


the goal box than did the low drive 
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animals and thus received larger re- 
ward. However, the two groups 
ailed to differ in terms of trials or 
errors to criterion, or times per trial. 


Variation in Duration of Incentive Ex- 


; 
f 


osure 
Kling (1956) employed a runway, 
thirst drive, and water as incen- 
Amount of water incentive was 
inipulated factorially with two 
levels each of duration of exposure to 
the drinking tube (15- and 120-eec.) 
ind drinking and 
m.). Several consummatory re- 


tube diameter (2 


nse measures were obtained and 
ing speeds over the last three of 
iily trials were found to be un- 
ystematically related to the volume 
water consumed per trial, time 
ver trial actually spent drinking, and 
ion of goal box time spent 

king. They were positively re- 
however, to ingestion rate, a 

of consummatory activity. 
In a similarly designed study, Hellyer 


1953) independently varied amount 


measure 


of water reinforcement and drinking 
tube size, and found that both vari- 


ibles affected runway latencies 


There was an inverse correlation be- 


tween duration of consummatory re- 
ponse and laten y. 

Fehrer (1956) carried out two ex- 
periments, one with a U maze, the 
other with a runway, using the thirst 
drive and varying both time in the 
box amount of incentive. 
Running speeds were found to vary 
systematically with 


goal and 
neither amount 
of incentive (40-sec. drinking vs. 10- 
sec. drinking followed by an addi- 
tional 30 sec. in the now empty goal 
box) nor time in the goal box (10-sec. 
drinking vs.. 10-sec. drinking fol- 
lowed by 30-sec. delay in the goal 
box 

Spence (1956) has pointed out that 
in the majority of the studies of the 
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type in which weight or number of 
incentive units are manipulated, du- 
ration of time spent in the goal box 
has been with magni- 
tude, Ss typically being allowed to 
remain in the goal box until, and 
only until, the incentive has been 
consumed (the Fehrer study, just 
described, is an exception to this). 
Two of his and 
Czeh, varied magnitude and dura- 
tion independently by allowing rats 
with larger amounts time in the goal 
box equivalent to that of animals 
with smaller amounts, but then al- 
lowing them to finish eating 
where. These two studies, one meas- 
uring bar-pressing the 
other, runway starting speeds (re- 
ported on pp. 138-141) both pro- 
duced evidence that performance 
varies as a function of duration in the 
goal box and not amount con- 
sumed. Thus, they with 
Kling and Fehrer in the finding that 


confounded 


Swisher 


students, 


else- 


latencies, 


agree 


performance does not vary with the 


actual amount consumed, but dis- 
agree with Fehrer’s finding that per- 
formance did with 


tion either. 


not vary dura- 


Variation in Concentration of 


and Saccharine Solutions 


Studies in which variation in in- 
centive magnitude is 
manipulating the concentration of 
sugar or saccharine solutions provide 
some apparent exceptions to the gen- 
eralizations that rate of learning is 
independent of quantity of reinforcing 
agent, and asymptotic perlormance 
is a monotonic function of quantity. 

The first investigator to vary con- 
centration of sugar 
Guttman (1953). Different groups 
of rats were presented 4%, 8%, 16%, 
or 32% sucrose rein- 
forcement for bar pressing, and the 
effect was measured on response rate 


achieved by 


solutions was 


solutions as 


during conditioning, extinction, and 
PR, and on time for original condi- 
tioning. The measures obtained dur- 
ing original conditioning are of im- 
mediate Asymptotic re- 
sponse rate, and rate of approach to 
this asymptote were found to vary 
with The 
between approach and con- 


concern. 


concentration. 
rate ol 
centration 


relation 
was monotonically in- 
the asymptotic level! 
increased up to 16% and then fell 
off at 32°. In addition it was found 
that the time required to emit 500 
responses during acquisition was a 
function of 
The two measures of acquisi- 


creasing, but 


decreasing concentra- 
tion. 
tion rate yield results seemingly in- 
consistent with findings previously 
reviewed. The 32% group clearly at- 
tained its terminal response rate with 
fewer reinforcements than did the 
But the asymptote 
for the 32% group is lower than that 
for either the 8% or the 16% groups 
Guttman that this lower 
asymptote might be due to uncon- 
trolled drinking behavior in competi 
If we are to 
reconcile the finding of faster acquisi- 


other groups. 


suggests 


tion with bar pressing. 
tion rate with increased quantity of 
incentive in this study, with the re- 
sults which failed to find such a rela- 
tionship, one possibility is to give it 
the status of an artifact. The non 
monotonic asymptote is, as indicated, 
likely an artifact, and this asymptotic 
function and the different approach 
rates are probably interrelated. If 
the 32% group has less distance to go 
to reach asymptote, it will get there 
sooner. 

In two studies, Young and Shuford 
(1954, 1955) investigated the effect of 
sucrose concentration on a running 
In the first study (1954), 
latency and running time were meas- 
ured where concentrations were 2%, 
6%, 18%, and 54%. By the end of 


response. 
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18 daily trials the groups had ordered 
themselves in latencies in 
the order indicated, greatest to least. 
But asymptotic running speeds were 
the same for all groups. Finally, these 
asymptotic 


terms ol 


speeds were ac hieved 
sooner the higher the concentration, 


rhis 


finding is possibly artifactual also, 


a finding paralleling Guttman’s 


because, on the initial trials, Ss with 
higher concentrations were running 
faster (all Ss had had preliminary 
training with the appropriate con- 
centration, and this can account for 
the initial gradient), and hence pre- 
sumably they had less far to go to 
In the 


tration 


reach the common asyn ptote 


second study (1955), concen 


was again varied in logarithmic stey 


from 2% to 54%, 


— 


ind 
were given Terminal 
were nonmonotonk lly 
concentration 
Hughes (1957 


ion and 


, ' 
soiution, and 


varied I 
| 
volurne 
cl irine isscss 


effect on latenc’ running 


percentage ol 
For 


wi maze 
ol corre 


*‘ntage 


correct respo 
Trials 2—40, 
t responses | 
nificantly with both 
il latency 


time in 


iprox 
running 
with volume but n oO ntration 


ior t he t ntave 


Finally, | correct 


response measure, there was a sif 


nificant interaction between volume 
such that for the 


1 


smaller of two volumes the effe 


and concentration, 
concentration wv mnotonic, 
the 
concentration nonmonotonk 
Finally, Smith and Duffy (1957 
reported faster learning of a T-maze 
habit when the reward was 4 cc. 20% 
than 
terms of increases in percentage cor- 


for the larger volume eflect 


was 


sucrose when only .1 cc., in 
rect, and decreases in running time 


Here we have another possible ex- 
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ception to the generalization that in- 
magnitude does not affect 
learning. However, their 
measures of rate of learning were 
somewhat unorthodox and no direct 
statistical comparisons in terms of 
volume were made. In the interests 
of parsimony, and until this desigi is 
repeated, preference is to maintain 
the original conclusion that learning 
rate is independent of incentive mag- 


nitude. 


centive 
rate of 


Zero Incentive Magnitude 


A final set of three papers are re- 
lated in that, in each, one of the mag- 
nitudes employed was a zero mag- 
nitude. Furchtgott and Rubin (1953) 
ised a two-unit, linear T maze and 
assigned different groups of rats to a 
single incentive unit weighing either 
0, 20, 75, 250, or 2500 mgm. No dif- 
three measures 
earning (trial to criterion, 
er of Ss attaining this criterion, 
ber of errors during the precri- 
were found except that 
receiving any positive 
nt did better than the group re- 
ng zero amount. However, mean 
ng speeds did differentiate the 
amount groups, the two 
irger reward groups running faster 
than the two smaller reward groups. 

lwo other studies support the find- 

1 positive amount leads to 
“arning than zero amount. 
Shea, and Elkind (1958) 
significant interaction be- 
tween incentive amount (a 1-gm. or 
pellet vs. no pellet) and 
trials, indicating a possible effect on 
rate of learning, but there was no 
mprovement at all when the goal 
box was empty. Smith and Kinney 
(1956) found faster bar pressing for a 
reward of 20% sucrose solution than 
for 0% (plain water), during a single 
26-min. session. 


I 
~~ 
/ 


lerences i! 


n terms ol 
of rate of | 
} 


period 


KrOUpS 
positive 


that 
faster | 
Seward, 
found a 


a .5-gm. 
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LEARNING VERSUS PERFORMANCE 
EFFECTS 

The tentative conclusions arrived 
at in the preceding section are that 
incentive magnitude does not affect 
rate of learning but that it does af- 
fect asymptotic performance. The 
present section will be concerned 
with whether this difference in as- 
ymptotic performance reflects a dif- 
ference in amount learned, or in level 
of performance independent of learn- 
ing. The experimental differentiation 
of learning and performance effects 
requires a. two-phase experimental 
design, comprising a training and a 
test phase. Training-phase per- 
formance preferably should be as- 
ymptotic before the test phase is in- 
troduced. 
the training phase would not, as we 
have noted, 


Measures obtained during 


differentiate learning 
and performance effects, as either 
could be and, if so, their 
measurements would be confounded. 


present, 


However, where incentive magnitude 
is varied during the training phase 
but held otherwise 
equated) during the test phase, dif- 
ferences in performance during the 
test phase would reflect differences 
in strength of learning. 
where incentive magnitude is held 
constant during the training phase 
but varied during the test phase, test- 
phase performance would reflect ef- 
fects of contemporary incentive mag- 
nitude and hence a direct effect on 
performance 


constant (or 


In contrast, 


These interpretations 
rest on the assumption that learning 
effects should be relatively perma- 
nent and carry over from one phase 
to the next, while performance effects 
should not (Lewis & Cotton, 1957: 
Maher & Wickens, 1954). 

An implication of these statements 
is that, if magnitude affects learning, 
the approach to the new asymptote 
under test-phase magnitude (where 
there has been a change in magnitude 


in going from one phase to the next) 
should be gradual, as it was in the 
training phase. But if magnitude af- 
fects momentary performance, the 
approach to the new asymptote 
might be either gradual or abrupt. 
The most compelling evidence that 
magnitude affects performance inde- 
pendently of learning would come 
from findings of an abrupt change in 
performance level. This is not a nec- 
essary requirement however, for it is 
possible that. competing response 
tendencies might prevent sudden per- 
formance _ shifts Pereboom: 
1957a, 1957b 


tive 


(see 
In contrast, if incen- 
affects amount 
learning is not 
strued as a one-trial affair, then the 
effects of change in magnitude must 
include a gradual change in level of 
performance. 


magnitude 


learned, and con- 


Variation in Weight and Number of 
Incentive Units 


The earliest study to use training 
and test phases, as here defined, was 
Crespi's (1942). In one of his experi- 
after rats trained with either 
had 
reached stable asymptotes their mag- 
nitudes were both shifted up to 16 
units 


ments, 


one or four incentive units 


In another experiment, 64- 
256-unit were shifted 
In all but the 1-to- 
running speed shifts 
were abrupt such that after but one 
the incentive 
amount, the running speed of the 
reached the level of Ss 
trained with 16 units from the start 
In the 1-to-16 this level was 
reached after two exposures. Addi- 


and groups 


down to 16 units 


16 condition. 
exposure to new 


animals 


case 


tional training at these new values 


led the animals to “overshoot” in 
their performance levels. That is, Ss 
with an increase in incentive magni- 
tude ran faster, while those with a 
decrease ran more slowly, than those 
trained with 16 units from the start 
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Both of these overshooting effects 
were significantly different from ob- 
tained or extrapolated values for the 
16-unit rats, and were labeled 


tion” 


“ela 
and “depression” effects, re- 
spectively. Later investigators have 
dubbed 
and ‘‘negative contrast” effects 
ott (1928) had earlier found 
similar to the negative contrast effect 
1 multiple T 


' nti 
incen ve 


contrast” 
Elli 


in effect 


them ‘positive 


when rats trained in 


with bran mash as 
shifted to sul 


Crespi’s results indicate that the ef 


maze 
were flower seed 
fect of change in incentive magnitude 
is one of sudden change in periorn 

ance, and thus suggest that incentive 
magnitude affects level of 


pertort 
ance rather than 

Zeaman (1949) also st 
fect of sudden shifts in ine 
nitude 


amount 


upon runway b 


Changes in latency were 

curring after but one exposure to the 
new incentive values. In one experi 
ment, the change in later 

rec tly proportior il to the « 

amount, and in another, based .upon 
extrapolation, a positive contrast ef 
fect obtained, but a 


negative contrast effect was 


possible 


t 


not sig- 


was 


nihcant. 
Another study from which infor- 
mation based on abruptness of per- 
formance change can be obtained is 
that of Spence (1956, p 130-13, 
Following 48 runway ttrials with 
either .05- or 1.0-gn 


values 


these 
As iin, 


were 


1. incentive, 


were interchanged 


changes in strength 


response 
instantaneous A significant nega 
was obt uned, but 


tive contrast eflect 


not a positive contrast eflect 

Metzger, Cotton, and Lewis (1957 
employed a factorial design in which 
four groups of rats received either 
two or eight 45 
the first 


trials. Speed of running during the 


mgm. pellets during 


and second 10 runway 


second (test-) phase varied only as a 
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function of incentive magnitude dur- 
ing that phase. In assessing contrast 
effects, this study has the obvious ad- 
vantage over the others that com- 
parisons need not be made on the 
basis of extrapolation to what per- 
formance might have been had incen- 
tive shifts not occurred, for, accord- 
ing to the design, only half of the Ss 
experienced a incentive 
Although changes in run- 
ning time occurred within one or two 
trials following incentive shift, there 

is no evidence for either positive 

negative contrast effects. 

It is clear from these studies that 
variation in 


change in 
imount 


quantitative incentive 
iffects level of performance rather 
than amount learned. But how are 
the contrast effects to be interpre ted? 
Crespi and Zeaman gave their Ss 18 
20 training trials before shifting in- 


; 


centive values; Spence gave his Ss 
48 preshift trials. Both Crespi and 
Zeaman found positive contrast ef- 
fects but Spence did not. 
argues that those found by Crespi 
id Zeaman were artifacts, due to 
the possibility that their rats had 
not had enough trials to reach a true 
before the shifts were 
His evidence, reported 
ibove, supports this interpretation. 


Spence 


asymptote 


introduced. 


In other words, perhaps performance 


continued 
without 


would have to improve 
the increase in in- 
centive magnitude. However, that 
Metzger et al. gave only 10 preshift 
trials, fewer than any of the other ex- 
perimenters, and failed to demon- 
strate positive contrast effects, ar- 
Out 
of four studies (excluding the Elliott 
study two demonstrated each 
of effect, and the one study 
which maintained control Ss _ not 
shifted yielded negative results for 
both types of effect. Before the mat- 
ter is resolved 


inyway, 


gues against Spence’s position. 


only 


type 


more information is 
needed on the conditions under which 
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contrast effects would and would not 
be expected to appear. Pereboom 
(1957b) has suggested that runway 
familiarity and competing explora- 
tory behavior are important variables 
in this connection. Two related pos- 
sibilities suggest First 
it should be noted that both Crespi 
and Metzger et al. employed meas- 
ures based on running time and that 
the runway used by Crespi, who ob- 
tained positive 
times as long as the one used by 
Metzger et al., who obtained negative 
results. Alley length may be an im- 
portant variable. 
the extent of preshift training, if ani- 


themselves 


results, was five 


Second, regarding 
mals are given sufficient trials to 
bring them to the true ‘physiological 
limit,”’ it would be difficult if not im- 
possible for low-to-high Ss to exceed 
in performance animals 
the higher amount. 

One study involving the 


ilready on 
training- 
test paradigm used a learning situa- 
tion more complex than the runway. 
Maher and Wickens (1954 
test situation 
under a drive different from 
used in Their were 
given 20 trials in a 14-unit multiple 
T maze with 22-hr. food de; 
and either one or five pellets as in- 
centive. No 

acquisition, in terms of 
found, but 
19-20) were greater in 


used a 
requiring relearning 
the one 


training. rats 


rivation 
differences in rate of 
errors, were 
(Trials 


spec ds 


the fiv 


terminal 
elaine 
pellet 


= 
group, findings consistent with those 


prece | ig 
Following a lapse of 24 


reviewed in the 


section 
mot ths, the 
maze relearned under 22-hrs 
water deprivation 
amount of water reward. During this 
relearning test, neither time nor errors, 
as a function of training-phase mag- 


nitude, were significantly different. 


was 


and it constant 


‘ariation in Concentration of 
Solutions 


Sugar 


Following conditioning, extinction, 
and reconditioning, Guttman’s (1953) 
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rats were tested under 1-min. PR for 
five daily 4-hr. sessions. Rate of re- 
increasing 
rithmic function of concentration. 
Note that in the 
phase involved, not a shift in incen 
tive magnitude, but a shift from con- 
reinforcement to PR, and 
that the relation between magnitude 


sponding was an loga- 


his study, test 


tinuous 


and performance changed from non- 
monotonic to monotonic. 

Young and Shuford (1954) shifted 
the concentrations of their rats fol- 
lowing the 18 conditioning trials re 
ported ibove. The general effect was 
a decrease in total time (latency plus 


runnin time) when concentration 


Was increased and an increase in 
concentration was 
eliect 
the 


was attributed to 


when 
Lhe 
iced 


and this difference 


total time 
eed 


ronoul 


was 


latter 


decre former 


more than 
a practice eftiect 

Dufort 
training 
with a 109% 


five bottle « 


and Kimble (1956) gave 


rats 20 trials in a runway 
sugar solution in one of 
ips located on the goal 
platform. Following. this, they were 
split and continued under either 0% 
extinction), 5%, 10%, or 20% con- 

additional 40 trials 


incentive con 


centration tor an 
Following the shift in 
centration, the 20% group changed 
7 


its percent 


[ correct 


the 
changed more gradually. 
and 10% 


prove 


ize responses 
other groups 
The 20°, 


continued to im 


abruptly, while 
groups 
the latter case 
had 


not been achieved before differential 


ndi iting in 


that asymptotic performance 


train 
100° 


was introduc ed), both reach 


ing o corres by the end of train 


{ 
the 5° and 0% 


ing, hile 

| Dif- 
ferences in reciprocal running times 
and ag the 
change in performance of the 20% 
group was abrupt while the other 
changes were gradual. Terminal run- 
ning speeds were a positive function 
of concentration, and the rate of ap- 


groups 
declined in percentage correct. 


were also assessed, un 
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proach to these speeds was the same 
in all cases except for the 20% group, 
whose slope constant was wgreater 
than that of the other groups. This 
be due to the fact that 
had not 
been achieved before differential con- 
Dufort 
the in 


is that the effect of changing 


difference may 
asymptotic performance 
centrations were introduced 
and Kimble conclude, 
dication 
the 


least 


amount ol reiniorcement 1s at 


partly on performance rather 
than habit’’ p 190). We 
that the etiect 18 


on perlormance They 


would con- 
lude exclusively 
also suggest, 
on the basis o ne near; 
the 5% and 

tion be concep 

case of decrea 


centive. 


magnitude on resi 
ight profit 
the extinctior 


periorman 


o! extinction 
j 


ever, aoes ti 


] 


, ’ 
io an 


Three studi 

easure of tri 
tinction criterion 
effects of prior i ntive magnitude 
while one found a systematic effect 
Thus, Miller 1947 
found difference in 


Lawren c and 


an insignificant 
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trials to running response extinction 
criteria of either 3- or 5-min. latency 
as a function of prior amount (one or 
four pellets). Reynolds (1950b) ad- 
ministered rewards to three groups 
of rats for 25 consecutive bar-pressing 
responses. The three groups received 
one 60-mgm. pellet, two such pellets, 
or one 160-mgm. pellet for each re- 
No significant differences 
between any of the 
s in terms of responses to a no- 
In a re- 
study, Reynolds, Marx, and 
(1952) obtained no sig- 
trials to ex- 
a bar-pressing response 


ponse 


were obtained 


grou 


response criterion of 5 min. 


ated 
Henderson 
int differences in 
n of 
following 120 vs. 30 mgm. reward 
Young and Shuford (1955) extin- 
hed their Ss with distilled 
following the 25 training trials previ- 
The slope of the ex- 
of running speeds 
ied directly with prior concentra- 
Nevertheless, Ss with lower 
ntrations reached an extinction 
than with 
} concentrations, perhaps be- 
cause they had less far to go to reach 


ctio 


water 


sly mentioned 


nction curves 


sooner those 


this criterion. 


Performance over a Constant Number 
linciion Tria 
Ze ill in 1949 


various experiments to extinction 


subjected the Ss of 


he running response after they 
ompleted their rewarded train- 
or without test- 
ses intervening between training 
tion. In all three of 
the effect of 
magnitude was to alter the rate of ap- 


ther 
Ciciel 


with 


} 


1 extimn his 


experiments, previous 
h to a final common perform- 
Gagné (1941) reported a 
inal extinction level (after 


proa 
nce le vel 
higher tert 
only five extinction trials) for greater 
reward ar! confounded with 
intertrial intervals. Metzger 
(1957) extinguished their Ss 


\ounts, 
lor yer 
et al 
after the training and test phases con- 


sidered earlier. Using running times 
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during Extinction Trials 2-11, or Ex- 
tinction Trial 2 alone, training-phase 
magnitude had no effect. However, 
test-phase magnitude had a signifi- 
cant effect on both measures, greater 
resistance following the larger 
amount. 

Following PR under three levels 
each of quantity and quality, Hutt's 
(1954) rats were given two }4-hr. ex- 
tinction sessions. 
extinction function of 
both variables, more responses being 
emitted following PR with 
amounts and preferred qualities. 
Guttman (1953) found rate of re- 
sponding during an initial 5-min. of 
extinction to be an increasing 
tonic function of the concentration 
of sucrose used during conditioning. 

Fehrer (1956) found greater re- 
sistance to extinction (in terms of 
both running speeds over a constant 
number of trials, and trials to various 
criteria) following the condition of 
10-sec. drinking followed by 30-sec. 
postreinforcement delay in the goal 
box, than following either 10-sec. or 
40-sec. drinking time in the goal box. 

According to the assumption that 
learning effects are relatively per- 
manent while performance effects are 
momentary, we would be led to con- 
clude, on the basis of extinction per- 
formance differences found for a con- 
stant number of trials, that magni- 
tude of reward does affect learning 
when the absolute method is used. 
However, it might be that the higher 
performance level during extinction 
following larger amounts is due to 
the fact that terminal acquisition 
level is higher the greater the amount 
of incentive. In fact this appears to 
be the case. Metzger et al. (1957) re- 
ported results of an analysis of co- 
variance upon their extinction scores, 
where performance levels at the be- 
ginning of extinction were equated 


Responses during 
varied as a 


larger 


mono- 
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in terms of running times during the 
five reinforced trials. Under 
the effect of 
test-phase amount on extinction dis- 


last 


these circumstances, 


They conclude, ‘‘.. . re- 
ward affects performance on extinc- 
tion through differential levels of 
performance just prior to extinction 
rather than affecting performance on 
extinction directly” (p. 188). If we 
this interpretation to the 
other studies in which performance 


appeared. 


apply 
was measured over a constant num- 
ber of extinction trials, the conclusion 
that 

aflect 
tained 


also 


magnitude does not 


amount ol 


incentive 
learning is main- 
Chis interpretation might 
the 
results of 


1955 


account tor apparently 
Young and 


ind Fehrer (1956) on 


anomolous 
Shuford 


trials to extinction criterion. 


INTERACTION OF INCENTIVE 
MAGNITUDE WITH OTHER 
VARIABLES 

Several investigators have manip- 
ulated 
ally 


magnitude factori- 


While 


incentive 


with other variables 


their results agree with previous con- 


clusions, they also provide informa- 
tion on possible interactions of this 
variable with 


other variables In 


parti ul 7. studies have allowed the 


assessment of interactions of incen- 


tive magnitude with quality of re- 
ward, drive level, and partial rein- 


forcement 
Quality of Reward 

Hutt (1954) found that the effects 
of quantity and quality of reward on 
rate of bar pressing during both PR 
and extinction were independent of 
each other, the interaction F ratio in 
the former case being less than one. 
Drive Level 


Seward, Shea, and Elkind (1958) 
factorially varied incentive magni- 
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tude and length of food deprivation, 
and assessed their effect on running 
Both main 
nificant, as was the interaction be- 
tween them. 


speed. effects were si 


‘ 


But this interaction is 
based on comparisons involving zero 


reward amount and zero hours dep- 


rivation. No learning took place 


under satiation or when the goal box 
was empty That the 


interaction 


might represent a special case found 


only when values of either incentive 
magnitude or | 


‘ 


are zero, as suggested | Se 


] 


al., 1s attested recent 


Reynolds and Pavlik (1958 
varied incentive 

and 2.0 gm riva 
(3, 22, and 44 hours) factor 
: . 


72 runway triais and repor 


ciprocal latenc tI 


es overt 


these trials Differences 


tion of both incentive amount an 


were in 


| 
| 
i 
I 


deprivation expecter 


direction and 
the 
variables was 


1.00 


However 
interact 


Reynolds et | 
reported a signil 
tween amount ot in 
level on trials to 


rion ol 5-min MA th no pre 
suc h that high dr ve—high re 
low drive-lov 
tinguished m 
drive-low rev 
reward animal 

lik suggest that this differe: 
be a function of the differer 
measures employed i 
Another possibility 

For Reynold 


lik, drive was defined in ter 


tion of drive. 


length of food deprivation. In 
trast, Reynolds et al. mar 
both fed and deprivatio 
time simultaneously. In 
would seem that 


pulate 
amount 
any event, it 


more research is 
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needed to specify the conditions of 
both drive and reward under which 
an interaction between them would 
be expected. 


Partial Reinforcement 


Hulse (1958) found a greater run- 
ning speed over the last nine of 25 
daily runway trials when the incen- 
tive was a 1.0 gm. food pellet than 
when it was a .08 gm. pellet, and 
when reinforcement was continuous 
than when appearing on only 46% of 
th He also measured running 
speeds over a constant number of ex- 
tinction trials and found a significant 
between the 

of reinforcement 
reinforcement 
irtial 


e trials 


teraction effects of 


prior 


percentage 
d amount of 


it, with | 


such 
reinforcement, 
to extinction 


resistance was greater 
ng a larger amount, but with 


reinforcement, the _ re- 


followi 


nftir 7 
Col UOUS 


verse effect obtained. 


MECHANISMS OF REINFORCEMENT 


In the introduction, four possible 
echani quantitative varia- 
were outlined: amount of nutri- 
available for assimila- 
preconsummatory stimulation, 
matory activity, and 
summatory stimulation. It was 
indicated that different ex- 
perimental operations produce dif- 
ferent combinations of variation 
rhus, regarding mecha- 
nisms of reinforcement must involve 
comparisons of experiments utilizing 
the different operations. 
We are actually concerned here 
th the conditions of reinforcement, 
| a distinction must be made be- 
necessary and sufficient condi- 
If quantitative variation is 
to affect behavior when one 


sms of 


nt material 


con- 


further 


de« isions 


ions 


show nh 


of the mechanisms is held constant, 


this would indicate that variation in 
that mechanism is not necessary for 
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incentive magnitude effects. On the 
other hand, lack of behavioral varia- 
tion would indicate that that mecha- 
nism is necessary. If only a single 
mechanism is varied and this pro- 
duces behavioral variations, it may 
be concluded that variation in that 
mechanism is sufficient. But if there 
is no behavioral variation, it must be 
that that mechanism is not sufficient, 
or that a necessary variation (if there 
are any) is lacking. 
garding and 
must be reached through comparison 
of studies and a process of elimina- 
tion. Because experimental manipu- 
lations of weight and number con- 
found all four possible mechanisms, 
studies of this sort are not crucial to 
the problem. 


Conclusions re- 


necessity sufficiency 


Preconsummatory Stimulation 


That variations in preconst'nma- 
tory stimulation are not necessary is 
attested by the results of studies in 
which either duration of incentive ex- 
posure, or concentration of sugar or 
saccharine are manipulated. In all of 
these cases, preconsummatory stim- 
ulation is presumably constant, and 
yet behavioral variations, previously 
reviewed, have been found 

Two studies in which preconsum- 
matory visual stimulation was inde- 
pendently manipulated indicate that 
variation in this mechanism is not 
sufficient either. McKelvey (1956) 
factorially varied the duration of in- 
centive exposure (30 vs. 180 
and visual size of the incentive (food 
tray of 1-in. diameter vs. 2-in. di- 
ameter), during acquisition of a 
black-white Dura- 
tion of reward affected the perform- 
ance measure, running time, but not 
the acquisition measure, errors. But 
by neither measure was preconsum- 
matory stimulation shown to have 
any systematic effect on behavior. 

This finding is corroborated in a 


sec.) 


discrimination. 
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study of delayed response perform- 
ance by chimpanzees using a modi- 
fied differential method, by Cowles 
and Nissen (1937). In one condition 
of their experiment, the chimps were 
shown either a large piece of orange 
without skin or a small piece with 
skin (the skin differences were used 
to equate eating times), but after 
the delay they always found the 
smaller. In the other condition, the 
chimps found the size they had been 
shown prior to the delay. Thus, with 
the first 
tory stimulation is 


procedure, preconsumma- 
varied but the 
amount remains constant; with the 
second procedure the two factors are 
With the former pro- 


correct re- 


confounded. 
cedure, percentage of 
sponses, latencies, and response times 
did not differ systematically in terms 
of whether the pre delay stimulus was 


large or small, but with the latter 


procedure, differences were in the ex- 
pected direction and, for the most 


part, statistically significant. 


{mount of Nutrient Material 


A number of studies involving the 
use of, and variations in the magni- 
tude of, which do not 
alter the effects of deprivation indi- 
cate that variation in the amount of 


incentives 


nutrient material available for assim- 
ilation is nota necessary condition of 
reinforcement. Amount of nutrient 


is effectively controlled in 


material 
these studies, by elimination. 
The pioneer study with saccharine 


that of Sheffield and 


This was essentially a 


incentives is 
Roby (195( 
lemor nal ly, designed 
demonstrational study, designed to 
determine if saccharine could act ef- 
fectively as a Although 
three experiments were conducted, 
only the third will be described here. 


reinforcer. 


For this experiment, rats learned a 
position habit in a T maze. 
deprived Ss were given 42 trials with 
saccharine solution in one arm of the 


Food- 
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the other. 
significant 
the SAaCct | 


side, as well as an increase 


maze and water in 
There 


crease in 


tap 
developed a 
choices of 
ingestion and decrease in 
time. 
The study of Hughes 

cussed in an earlier section 
th it, 


ibsence of reir 


not only can rats 
forcement | 
material, but that perfo 
ferentially affected by 
concentration. 

Iwo studies using 


vated male rats as Ss 


females as incentive als¢ 


behavior may be modified 
sence of alterations in the 
deprivation Shefheld, 

Backer (1951) compa 

speeds when the incentive 
a female rat in heat or ar 
Even though ejaculation 
mitted, the Ss ran faster 
tive female than to tl 
tive. Kagan (1955) four 
percentage ol correct re 
faster running 
when the 


with ejaculation 


speed iT a 
incentive Was CO] 

than when only 
utted. Perfor: 


both of these conditions 


tromission was pern 
ance under 


was in turn superior to that wher 
only mounting was 
Several studies, already d 


indicate 


illowed 


that, when ir 
administered peripherally, varia 
in amount of nutrient material is n¢ 


a sufficient mechanism of reinforce 
Kling (1956 1 


ment. found fr 
speeds to be unrelated to vo 


water consumed per 
Fehrer (1956) also found runnin 
speeds to be independent of 
of water reinforcement. Swish 
Czeh (Spence, 1956) found 
ance to be unrelated to an 
food consumed. 

It should be emphasized that 
conclusion that 


nount 
and 
periorm 


ount ot! 


the 


nutrient material, 
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and hence that variations in amount 
of nutrient material is not a sufficient 
condition, applies only to situations 
in which incentives are administered 
peripherally. If we include evidence 
from experiments in which peripheral 
factors are by-passed, then the con- 
clusion must be that variation in nu- 
trient material is a sufficient condi- 
tion. See for example, the studies of 
Coppock and Chambers (1954), and 
Miller and Kessen (1952). But the 
remaining evidence will indicate that, 
when incentives are peripherally ad- 
ministered, the reinforcing 
is itself peripheral. 


mec ha- 
nism 


Amount of Consummatory Activity 


When 


ied by 


incentive magnitude is var- 

manipulating the concentra- 
tion of sucrose solutions, the amount 
of consummatory activity is presum- 
ably held constant. The studies re- 
viewed earlier indicate that perform- 
ance does vary as a function of su- 
crose concentration, so it would seem 
that variation in amount of consum- 
matory activity is not a necessary 
mechanism. 

However, variation in consumma- 
tory activity does seem to be a suffi- 
A number of studies 
in which ingestion rate was measured 
would so indicate. For example, Shef- 
field, Roby, and Campbell (1954 
found a high positive correlation be- 


cient condition. 


tween running speed and ingestion 
water, 
charine, dextrose, or dextrose plus 
saccharine solutions), 
that 


functio 


rate (incentives were sac- 
direct evidence 
performance is an increasing 
n of amount of consummatory 

The Kling (1956) study also 
indicates that the important consum- 


matory 


activity 


ictivity variable is ingestion 
rate 

Supporting evidence comes from 
the finding of Sheffield, Wulff, and 
Backer (1951) of a positive relation- 
ship between running speed and per- 
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centage of opportunities to attempt 
copulation during which the attempt 
was actually made. 


Amount of Consummatory Stimulation 


When duration of incentive ex- 
posure is manipulated, the nature 
of the consummatory stimulation is 
held constant (although its duration 
may vary). Several studies of this 
type (e.g., Kling, 1956; Spence, 1956) 
have found performance variations, 
suggesting that 
summatory stimulation are not nec- 
essary. 

That they are sufficient, however, is 
indicated in a study by Cockrell 
(1952), similar in design to Gutt- 
man’s (1953), but manipulating con- 
centration of saccharine rather than 
of sucrose solutions. By his pro- 
cedure, the only mechanism assumed 


Variations in con- 


to vary was consummatory stimula- 
tion (taste). During conditioning 
under continuous reinforcement, bar- 
pressing rates reached nonmonotoni 
asymptotes as a function of concen- 
tration. During extinction and PR, 
rate of responding was approximately 
a linear function of the logarithm of 
sact harine concentration. 

We may close this section with the 
tentative conclusion that none of the 
mechanisms of reinforcement are nec- 
essary for bringing out performance 
differences as a function of i 
magnitude, but that 
either amount of consummatory ac- 
tivity or stimulation associated with 
that activity are sufficient conditions. 
By way of qualification, it should be 
pointed out that there is some diffi- 


centive 
Variations in 


culty in manipulating consummatory 
stimulation while holding consumma- 
tory activity constant (cf. Guttman, 
1953). Thus it may be that only the 
concurrent variation in 
tory activity and consummatory 
stimulation will prove to be sufficient. 


consumma- 
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THE COMPARISON OF INCENTIVE 
MAGNITUDES: THE DIFFEREN- 
TIAL METHOD 


The term differential, as used here, 
will be somewhat broader 
meaning than that originally assigned 
by Lawson (1957). Several differen- 
tial procedures may be distinguished: 

1. Single Method 
Throughout the experiment there is 
but a single task or problem before 
the S, but in different stages of the 
experiment he 


given a 


Problem 


different 
amounts, for example, according to a 


recei ves 


latin square design. 

2. Simultaneous Discrimination 
Selective learning is involved 
that one of two responses is followed 
by a greater amount, the other by a 
lesser amount. S must learn to dis 
criminate magnitudes. 


such 


3. Successive Discrimination—On 
trials, S makes one re- 
sponse followed by a larger amount, 
or a second response followed by a 
smaller 


successive 


amount 


4. Successive Problems—aA series 
of problems are to be learned, each 
problem correlated with 

amount The “learning 
paradigm (Harlow, 1949) pro 
vides a familiar example. 


It will be 


a different 
incentive 


sets’ 


noted that in some cases 
the distinction between absolute and 
differential methods will appear arbi 
trary. Certainly the previously dis- 
cussed training test-ph ise design un- 
der the absolute method involves ex- 
posure by the same S to more than 
one il 


centive However, 


were 


amount. 
separate made 
under each phase of the experiment, 
and statistical 
rate. 


viewed in 


measurements 


analy ses were sepa 


In contrast, studies to be re 
this 


section are ones in 


which a single analysis is performed 
on data obtained under variable in 
centive conditions for the same S. 
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Single Problem Method 

Gantt (1938; also reported in Hull, 
1943, p. 125) presented a of 
asymptotic (? 


curve 
conditioned sal 
tion in a single dog as a fur 


centive magnitude ;, 1, 
grams of food. The curve 


an increasing, negatively 


function. It is interesti 
that this is apparently 
ol 


ture based upon the 


incentive magnit 


tioning procedure 
Nissen and 
both the size « 
the 
weight pieces 


on the limits of 


nana and 


an accuracy ol 
total number 
to 20, 


accom! 


and the 
effect 

trast effec ts Ww 

ond delayed respons 
& Nissen, 1937) | 
cited. They fe 
shorter latencies 


amount, 


eration 


small incentives 
Fletcher (194 

of incentive n 

bar ina 

pulling respor 

ind found 


tween trequenc' 


a pos 
centive length 
time measures 


ter ted, 
tency, pulling ti 


among 


of tugs per trial 
At this point, 

study (1928 

He noted in reward 

periments that wher 


the 


pieces ol food under a < 


served experimen 


Ip, but later 
found only one piece, certain emo 


tional 


and searching en- 


responses 
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sued. This behavior did not occur 
when one small piece was substituted 
lor one large piece. 

Michels (1957) used a latin square 
design to assess the effects of amount 
of reinforcement (.5, 1, 2, or 4 pea- 
nuts) on latency of response to a 
The WGTA was 
used and the rhesus monkeys who 
served as Ss were on a 20% partial 
reinforcement schedule throughout. 
Latencies with the .5-peanut reward 

significantly greater than with 

three larger rewards, which did 

not differ reliably from each other. 
rhe 


, 
ne singie 


single test object. 


were 
the 
remaining studies utilizing 
problem method are all 
operant conditioning studies. Jenkins 
and Clayton (1949) allowed a group 
of pigeons either 2-sec. or 5-sec. ex- 
posure time to the food magazine in 
1 counterbalanced order during PR 
and found a faster rate of key-peck- 
ng with the longer duration. Roughly 
many consummiatory re- 
sponses were emitted during the 5- 
sec. exposure than during the 2-sec. 
exposure 

Foll 


ibed. 


twice as 


wing the training previously 
20 of Guttman’s (1953) 
rats were given further PR training, 
each S under each of the four sucrose 
trations used. 


1esc! 


A linear rela- 

p was found between rate of 

nding and the logarithm of con- 
centration, as had been found with in- 
dependent groups under PR. Ina 
Guttman (1954) com- 
pared the reward values of various 
sucrose and glucose concentrations, 
in equal logarithmic steps 
to 32°. For both sucrose 
the relationship be- 
rate of bar pressing under an 
aperiodic schedule, and log concen- 
linear. 


second study, 


varied 
from 2°, 
and glucose, 
tween 
tration, 


was Throughout, 


rate was higher with sucrose than 
with the corresponding glucose value. 
[wo latin squares were used such 
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that any S experienced all concentra- 
tions of one substance but not the 
other. 

Conrad and Sidman (1956 
tigated the effect of sucrose solution 
concentration on bar 
rhesus monkeys. Each S experienced 
each of seven concentrations, rang- 
ing from 0% to 60%. The maximum 
rate occurred between 15% and 
30%, with a decline at 60%. Ver- 
have (1956) assessed the effects of 
seven different sucrose 
tions presented to the same rats on 
response latency and rate for a two- 
member chain (pulling 
Concentrations varied from 2% to 
32%. Response rate increased and 
latency decreased as a function of 
concentration, both functions reach- 
ing their asymptote in the neighbor- 
hood of 20-21% concentration. 

Collier and Siskel (1959) and Col- 
lier (1958) have attempted an 
amination of the factors producing 
nonmonotonic 
which have been found with either 
the absolute or the _ differential 
method. Collier and Siskel 
sucrose concentration (4%, 8%, 16%, 
and 32%) and PR interval (.5-, 1-, 2- 
and 4-min.) factorially and assessed 
their effects on bar-pressing rate in 
rats. Both main effects, 
their interaction, 
The nature of the 
such that the nonmonotonicity of the 
obtained concentration function was 
itself a decreasing function of the 
inter-reinforcement interval. Collier 
(1958) discusses these and other data 
and concludes that a nonmonotonic 
relationship will most likely be found 
under any of the following condi- 
tions: an extended test-session, large 
volume (Hughes’ [1957] study veri- 
fies this assertion), high concentra- 
tion, or short time between reinforce- 
ments. 


inves- 


pressing by 


concentra- 


responses). 


ex- 


incentive functions, 


varied 


as well as 


nificant. 


interaction 


were silt 


was 
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Simultaneous Discrimination 

Festinger (1943) allowed rats 10- 
sec. exposure to whole wheat grain in 
one arm of a discrimination box and 
l-sec. exposure in the other arm. By 
the end of 96 trials, the rats were re- 
sponding on free-choice trials to the 
side with the greater exposure time 
at an above-chance level. Denny and 
King (1955) the study 
with .7-gm. reward in one goal box of 
a T maze and .1-gm. in the other. By 
the end of 84 training trials, a prei- 
erence for the larger reward side had 
developed Discrimination training 
was by reversal training, 
with the large- and small-reward 
sides interchanged. By the end of 72 
reversal trials the Ss were running to 

side with a frequency 
greater than chance. 

Pereboom (1957a) rats 
free- and two forced-choice trials per 
day in a T maze for either 10 or 21 
Five food pellets were found 


replicated 


followed 


the opposite 


gave two 


days 
on one side, one on the other. Follow- 
the large and small 
rewards had their sides interchanged 


ing this training, 


until a total of 35 days had been com- 
pleted. Asa third stage of the experi 
ment, for an additional 12 days the 
amounts were equalized on 
the end of the 
10 training days, all Ss were 
consistently to the large 
reward Reversal, as in the 
Denny and King study, was gradual, 
but complete by the end of reversal 


reward 
the 
initial 


two sides sy 
responding 


side. 


During the subsequent re 
the Ss 
tended toward chance performance 
The gradual reversal found by both 
Denny 


training. 


ward-equalization phase, 


and King, and Pereboom, is 
to be contrasted with the abrupt per- 
formance changes found when incen- 
tive magnitudes are altered in the 


runway. Pereboom interprets this 


inconsistency in terms of competing 
exploratory behavior which retards 
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reversal learning, and would be ex- 
pected to be greater in the more com- 
plex T maze than in the runway. 

Wike and Barrientos (1957) 
amount of material 


held 
and 
consummatory stimulation constant 


nutrient 
while allowing only duration of con- 
This 
the di- 
gy tube from 
the water incentive (for thirsty rats 
ned. With the smaller 


matory ac 


summatory activity to vary 


was achieved by vary ng 
ameter of a drink which 
was to be obta 
diameter more 
tivity 


stant 


was req iif 


amount of water Che two 

pitted against 

® i 

and the rats were required to learn a 

position discrimination on this basis 
By the the 27 

given, 85% of 

with the 
significantly 


ameters were 


other in opposite arms of 


| 


end of daily trials 


responses were to the 


side smaller diameter, a 


level above r h ince 
results nicely confirm an 
lier that 


neither amount of nutrient m 


These ae 


conclusior Variations in 
iterial 
tory stimulation are 


nor consumn 


Variation in consum- 


necessary, but 


matory activity is a sufficient mecha 


nism. 
Successive Discrimination 


D'Amato 


on a successive di 


1955) gave rats training 
crimination prob- 
lem ina such that on half 
the trials a larger incentive was 
ciated with a goal 


while on the other half a 


runway 
isso 
box of one color 

iller in- 
with a dif 
By the end 
the Ss were run- 
the 
smaller 


centive was associated 


ferently colored goal box 
of 70 training trials, 
faster 

the 


1) } 
e€ I loved 


significantly for 
larger reward than 


Greene (1953 


ning 
for 
i design 
discrimination 


utilizing successive 


a training-phase and simul- 


during 
ig a test- 
a factorial design it was 


taneous discrimination durit 
phase. By 


possible to assess learning vs. per- 
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During the train- 
ing-phase, rats found either a large 
pellet in a black goal box and a small 
pellet in a white one, or the reverse. 
During the test-phase, either a large 
or a small reward was found in the 
black arm of the apparatus, the 
white arm being empty. Number of 
errors and number of trials to crite- 
rion during the test-phase were sig- 
nificantly less for the animals having 
the reward associated with 
black during the training-phase, but 
performance did not vary systemati- 


formance effects. 


larger 


cally as a function of contemporary 
rhis result is flatly counter 
to test-phase results reported earlier 
using the absolute method. Here we 
have evidence for a persistent effect 
of training-phase amount on later 
which can be inter- 
preted to mean that, with the dif- 
ferential method, amount of incen- 
tive affects learning. This conclusion 
will be verified in subsequently dis- 


amount, 


periormance, 


cussed papers. 

Powell and Perkins (1957) manipu- 
lated duration of incentive exposure 
( successive discrimination 
and found it to influence 
discrimination _ test- 
confirming 


luring 
training 
simultaneous 
phase performance, 
Greene's results. 

stve Problems 


c 
u4CCE 


which Ss _ are 
trained on a number of consecutive 
with different incentive 
nagnitudes associated with different 
yield clear-cut 

incentive magnitude 


Experiments in 
problems, 


L 


evidence 


affects 


problems, 
that 
learning. 

rhe first, and probably the most 
significant, of these experiments was 
published by Meyer in 1951. Meyer 
trained eight rhesus monkeys, highly 
sophisticated on several tasks includ- 
ing discrimination reversal, on a 
series of 64 different discrimination 





108 B 


reversal problems in the WGTA. 
Each problem used a single pair of 
objects throughout an original learn- 
ing phase and four 
(ABABA). Response to the correct 
object yielded a constant amount of 
incentive for any given reversal but 
varying amounts over the four re- 
versals of that problem. Statistical 
analysis was of reversal errors as a 
function of the 
practice, or number of prior prob- 


reversals 


following factors: 
lems; reversal number within a prob- 
lem; prereversal reward (one or three 
pieces of raisin or peanut); postre- 
versal reward (one or three incentive 
units); prereversal criterion 
four successive 
and criterion (same 
The critical comparisons for the 
ent context are those involving pre- 


two or 
correct responses 
postreversal 


pres- 


and postreversal amounts, and stage 
of practice. 

Both pre- and postreversal amount 
affected reversal performan e, 
the first-order interactions 
pre- and postreversal 
between 
practice, 
means is 


and 
between 
and 
amount and 
What this 


prereversal 


amount, 
postreversal 
were significant 
that both 
amount (training-phase amount 
postreversal 
amount), 
formance 


and 
amount (test-phase 


iffect postrever | per- 


The learning of discrim- 
ruled 
that Ss entered this 


with a 


ination reversal per se may be 
out as a factor in 
experiment 
amount of reversal learning experi- 
ence, and over-all changes in per- 
formance with stage of practice were 
insignificant (total errors on the first 
half of the experiment were 1,402, on 
the second half, 1,404). So it would 
seem that we unambiguous 
evidence that incentive amount af- 
fects learning as well as performance 
independently of learning. The sig- 
nificant interactions imply that the 
effects of pre- and 
amounts are not independent of each 


( onsiderable 


h ive 


postreversal 
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other. However, this nonindepend- 
ence was apparent on early problems 
only. By the termination of the ex- 
the four treatment 
binations of and 
amounts were ordered in their effect, 


periment, com- 


pre postreversal 
most to least errors, as follows: 3-1, 
1-1, 3-3, and 1-3 
cates possible long-lasting contrast 
effects, 
that are slow to develop. The meas- 


ure Me 


criterion, 


Chis ordering indi- 
both positive and negative, 


yer en ployed was errors to 
a measure we have taken to 
Thus, 
the results of the experiment taken 


represent effects on learning. 


as a whole suggest the following con 
Incentive magnitude affects 
learning when Ss experience different 
but the Ss 
must first learn to discriminate mag 
nitudes Once magnitude discrim- 
ination is achieved, differential ef- 
fects on problem learning emerge. 
This receives support 
from a more recent study by Schrier 
and Harlow (1956). They used color 
discrimination problems to assess the 


clusion 


incentive magnitudes, 


conclusion 


effects of incentive amount (one, two, 
or four 


» 9 


gm. pellets), problem dif 
| practice, on a learning 


ficulty, an 
measure, correct re- 
in Java monkeys. All main 
effects were significant, as was the 


percentage of 
sponses 
practice by amount interaction. This 
reflects the finding that performance 
begins at about the same level for all 
amounts, but subsequently the learn- 
ing-set curves diverge, so that dif- 
ferent asymptotes are approached. 
“,..thata 
learning process is involved which in 


fluences the perception of, and in 


Schrier and Harlow say, 


turn, the response to, varied amounts 
of incentive, and that such learning is 
independent of discrimination learn- 
ing per se”’ \p 120). 

Meyer and Harlow (1952) investi 
gated the effects of incentive amount 
(1, 2, 3, or 4 units) on delayed re- 
sponse performance of rhesus mon- 





INCENTIVE MAGNITUDI 


keys, using the WGTA 
of errors decreased with increasing in 
and the 


with practice trom 


Per entage 


centive amounts, incentive 
function changed 
one with slight positive acceleration 
to negative acceleratior 

(1956 report 


Davis ted result 


two experiments on problem solvil 
He four 


ve a 


in monkeys 


crease in incent 
increase in balks i.€ 
centage of re sponses 
in 


tio! 


errors, on visual 
Phe 


on balks was apparent 


} problems 


tion effects with the 
In a d study 


problem, three raisir 


relate 


Trial 2 errors th 


no significal 
ber of balks 

Finally, Leary 
four conditions of reward 
fect on serial discrimination 
The fi 


tions, orthogonal to four 10 


in rhesus monkeys 


ina greco latin squi 
all pairs 
peanut for all pairs 

ilf for 
and two peanuts for two 


ire 


nuts tor ota 


five pairs, one-! the 
half peanut for the rema 
The two homogeneous 
tions led to periormans 


that with heterogeneou 


‘ 


difference attributable 


differences in errors on small reward 


pairs under the two conditions 


Paired-Comparisons Studies 
[wo studies, although employing 
ethod, 


the above cate- 


variations on the differential n 
do not fit into any of 
gories These are paired comparisons 
studies of food prefere: ce as a fun 

tion of magnitude. Harlow and 
Meyer (1952) gave naive 
rhesus monkeys 400 WGTA trials in 
which they were to choose between 
two peanut amounts: }, 4, 1, 2, or 4 


seven 


LE 
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This gave 10 pairs. Result- 
ing percentage choice scores were 
scale values by a 
method similar to Thurstone’s Case 


peanuts 


converted to 
III, resulting in an increasing linear 
relationship between scale value and 
log incentive amount. Fay, Miller, 
and Harlow (1953) had their mon- 
keys choose between a preferred qual- 
and a nonpre- 
ility (potato), when each of 
s of the preferred was paired 
ith each of two values of the non- 
d. They found that, as the 

io of preferred to nonpreferred 
ount increased (from 1/64 to 1/2), 
the choices of the preferred food in- 
the percentage 
choice depended, not upon absolute 
imounts compared, but upon their 


ity peanut or bread 
tert 
hve 


ed qu 


1 
Vaiut 


creased Further, 


parison of Absolute and Differen- 


- oe 2 
lethnods 


Several studies afford a more direct 
comparison of the absolute and dif- 
ferential methods than has been pos- 
sible 


that the two methods are compared 


in studies reviewed so far, in 


within a single experimental design. 
Logan, Beier, and Ellis (1955) com- 
| 


nare 
; ii i 


runway speeds during acquisi- 
tion when the incentive was nine 45- 

gm. pellets for one group of rats, 
five such pellets for a second group, 
ind, for 
ndom half of the trials and one on 
Thus, the first two 
groups were trained according to the 
ibsolute method, the third according 
to the differential. Over-all running 
60 daily trials) was greater 
for the first group than for either of 
the other Inspection of their 
Fig. 2 (p. 264) indicates similar rates 
of approach to terminal speeds in all 


a third group, nine pellets on 
aftfa 


the other half 


speed 


two. 


cases 

In a follow-up study, Logan, Beier, 
and Kincaid (1956) report the extinc- 
tion results of this and a related ex- 
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periment. For the first experiment, 
none of the three groups differed in 
their relative rates of extinction. In 
the related experiment, extinction 
followed 60 acquisition trials with 
nine pellets on each trial, nine vs. 
zero each on a random half, or nine 
vs. one. In this case the two varied- 
amount (differential method) groups 
were superior to the constant-amount 
group. The conclusion would seem 
to be that relative rate of extinction 
is retarded by previous comparison 
of incentive values. 

Lawson (1957) 
of incentive magnitude on discrim- 
ination learning by having rats learn 
two problems concurrently, a black- 
gray 
gray discrimination, gray being nega- 
tive in both instances. For some Ss 
the reward was the same for correct 
responses on both problems, either 


assessed the effects 


discrimination and a _ white- 


five pellets or one pellet (absolute 
groups). For other Ss the reward was 
five pellets on one problem and one on 
the other (differential groups). The 
two absolute groups did not differ 
significantly from each other in terms 
of errors over a constant number of 
trials. 
groups showed significantly fewer er- 
rors on the large reward discrimina- 


In contrast, the differential 


tion than on the small reward one. 

Two studies similar in design to 
Lawson’s are reported by Schrier 
(1956b, 1958). In both 


monkeys were divided into a 


studies, 
differen- 
tial and several absolute groups 
various phases of each experiment, 
differential Ss expe 
with 1, 2, 4, or 8 food pellets. I: 
trast, members of each of four abso- 
lute groups received experience with 
only one of the four magnitudes. In 
the first study (1956b 
measured on the latency 
to a single object covering a baited 
food well. By either method, per- 


received rience 


con- 


effects were 


of res] Onse 
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formance improved as_ incentive 
amount increased. However, the ab- 
solute and differential methods did 
not produce differences in over-all 
performance, and the slopes of the 
functions relating incentive magni- 
tude to latency did not differ signiti- 
cantly. In the second study (1958) 
effects on errors during a series of 
discrimination were as- 
sessed. Again, the two groups did 
not differ in over-all performance 
However, the slope of the incentive 


problems 


function was significantly greater for 
the differential method than for the 


absolute method. 


ACQUIRED REWARD VALUE AS A 
FUNCTION OF INCENTIVE 
MAGNITUDE 


Experimental studies of secondary 
reinforcement were recently reviewed 
by Myers (1958). On the basis of the 
evidence he (D'Amato, 
1955; Hopkins, 1955; Lawson, 1953 
Myers concluded that, “ . the ef- 
fect of amount of reward is so slight, 
that it can only be detected when the 
S is forced 


i reinforcers previously asso- 


disc ussed 


to choose between sec- 
ondary 
ciated with different sized rewards” 
(p. 294). The finding was that an ef- 
fect appeared only when training was 
by the differential method, but not 
the absolute 

Two studies have appeared since 
In the first of these 
compared the abso- 
il methods within 


Myers’ revicw 
Lawson (1957 
| 


different 


lute a 


a ] 
a Siligiec 


their effects on 


aesign, in 


strength By 


secondary reward 
neither method was a differential ef- 
fect of primary reward amount on 
secondary reward strength demon- 
strated. 

However, a study by Butter and 
Thomas (1958), using the absolute 
method, indicates that perhaps My- 


ers’ conclusion should be revised. 
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During training, click was paired 
with the presentation of either 8% 
or 16% sucrose solution, and during 
testing, bar pressing produced the 


clic k. 


nificantly more responses during the 


The 24% group emitted sig- 


test than did the 8% group, indicat- 
ing a differential effect of magnitude 
of primary reward on the strength of 
secondary reward. Butter 


Thomas suggest that the earlier fail- 


and 


ures with the absolute method might 
that the 
primary amounts were too large, to 
ward the asymptote ol 
reward amount function. 
Actually, it does not look 


be due to the possibility 


the primary 


cieare- con lusior S can be 


until more research has been 
pleted, 
similar to Lawson's (1957), allow 
direct comparison of the 
differential 


the safest stateme 


for example using a 


ibsolute 


Perhaps 


and methods 
made is the e1 

that amount 

attect 


and that this eff 


strength 


be evidenced if the differential 
ot incentive pr 
Finally, this 


cluded by mentior 


esentation is used 
section might be con 
1 of Wolfe's rigitr 

experiments on token-reward (1936 
Wolfe used the differe: thod 1 
which selectior token 
yielded two grapes, a white toke 


vielded one Krape, i d 


a brass tok 


yielded nothing. Three chimpanzees 


learned to consistently choose the 


blue token, demor 


| 


reward vaiues ¢ 


istrating tha 
ondary 
criminated, just as 


values can 


SUMMARY AND CoNn¢ 


LUSI 


soni‘ nt find 
st sig cant find 


Perhaps the mo 
ing of this review is 


the 
high degree of consistency of results. 
The facts 


relatively 


seem well.ordered, and 
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where contradictory results are 
found, such disagreements can usu- 
ally be fairly readily explained in 
terms of procedural differences. The 
empirical laws are now before the 
theorist, for him to explain in what- 
ever logically consistent way he can 
Because the findings seem so 
well ordered, he should perhaps have 
less trouble than in other aspects of 


behavior theory. 


de vise. 


We may conclude by bringing to- 
the various empirical gen- 
eralizations scattered throughout this 
and offering some suggestions 
or further research. The following 

neralizations have emerged from 

» review of the evidence on incen- 

magnitude: 

1. With the absolute methad, 
quantitative variation in incentives 
apparent effect on rate of 

A possible exception in- 
volves the manipulation of concen- 
tration of sugar solutions. A second 
exception is that learning is more 
rapid with any positive incentive 
imount than with zero amount. 

2. Asymptotic performance is an 


gether 


paper, 
| 
: 


h is no 


learning. 


ncreasing function of incentive mag- 
nitude. The function is negatively 
iccelerated, and possibly logarithmic, 
although under some circumstances, 
nonmonotonicity appears. 

3. These asymptotic differences re- 
flect a direct effect on the level of per- 
formance, rather than an indirect ef- 
fect based on differences in amount 
learned. 

4. Magnitude of reward affects re- 
extinction indirectly 
through differences in terminal level 
of rewarded performance. 

5. There exists conflicting evidence 
on the interaction of incentive mag- 
nitude with drive level. 

6. None of the mechanisms whereby 
results of quantitative 
variation are effected are necessary. 


sistance to 


behav ioral 
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7. Sufficient mechanisms seem to 
be amount of consummatory activity 
and stimulation from the incentive 
associated with consummatory ac- 
tivity. In the former case, it is the 
rate of consummatory activity which 
seems paramount. 
clusions is based upon exclusion from 
consideration of administration of 
nutritive substances other than 
ripherally.) 

8. With the differential method, 
incentive magnitude affects learning, 
but only after Ss have learned to dis- 
criminate amounts. 

9. The asymptotic incentive func- 


(This set of con- 


pe- 


tion is steeper when training is by the 
differential method than by the abso- 
lute method. 

10. Acquired reward value is an 
increasing function of magnitude of 
primary reward. 

To 
added a comment on the use of time- 
dependent measures as measures of 
performance and 
measures as measures of learning. In 


these conclusions might be 


time-independent 
the section on learning vs. perform- 
ance effects by the absolute method, 
it was concluded that incentive mag 
nitude affects performance hut 
learning. And it was typically time- 
dependent that 


not 


measures were af- 
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THE PARAMORPHIC REPRESENTATION OF 
CLINICAL JUDGMENT’ 
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The primary task of clinical diag- 
nosis is that of collecting, evaluating, 
and assimilating 
respect to the patient. 
point is the information 
may be in the 
test results, biographic 
on psychological 
symptoms, or other observables. The 
end result is a judgment; this may 
take the form of a recommendation 
concerning treatment 
a decision that certain other data are 
necessary before final 
made, or a classification of the pa- 
tient into a 
What intervenes between beginning 
and end is, for each cliniciar 


information with 
The starting 
itself; this 
laboratory 


al data, 


manifest 


form ol 
scores 


tests, 


or discharge, 
judgment is 
diagnostic « 


itegory. 


, a quite 


complex idiosyncratic process. It is 


the purpose ol this paper to demon- 
strate that the process 1s ¢ ipabl of 
rigorous investigation and 
tion. 


des rip- 


Pro ESS 


THE MENTAI 


In dealing with the 
which clinicians 


at their disposal to arri 


1 


utilize 


ments or decis 
that 
cerned primarily wit 


ons, it 
investigations wo 
nh ment il 
mental 


rr 
roc- 
pt 


esses; and since processes 


have often been equated to or pla ed 


within the realm of subjective ex- 
1 This investigation wa 
Public Health Service 
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The analysis of mt 
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ing Services, University of Washington, and 
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University of California, Los Angeles 
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ich of the data referred 


perience, it would be well to make one 
or two observations for purposes of 
In the first place, the 
term mental process is often directly 


clarification 


equated with subjective experience. 
But as private experience, the men- 
tal process is not observable. Hence, 
acceptance of this definition places 
the beyond the realm of 
legitimate scientific inquiry, except 
as it may be inferred from observ- 


process 


able phenomena such as verbal re 
And since no criterion can 
the validation of inferences 


sponses 
exist for 
concerning subjective experience, the 
inferences are simply ways of find- 
ing agreement in the use of language 
or other symbolic responses between 
If an 


inferences 


the subject and the observer. 


observer makes ‘ good"’ 
a subject, this means at 
that a exists be- 
tween them with respect to the sym- 


boli 


standing of 


concerming 
most consensus 


behavior involved. U nder- 
the mental process qua 
subjective 
beyond this level 

On the other hand, mental process 
It may 


physical (e.g., 


experience can never go 


may be alternately defined. 


be considered is a 


neurological, biochemical) event ca 
pable of direct observation, 1.e., using 


electrophysiological, neurophysio 


| 
and simil 


logical, ir techniques. To 
be sure, these techniques have so far 
yielded little that satisfactorily de- 
scribes cognitive mental functioning, 
and this is perhaps unfortunate. It 
does not follow that the approach is 
Improvement in the tech- 
niques of measurement and in the 
application of more explanatory mod- 


els may ultimately result in great 


sterile. 
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progress, even though this may seem 
remote at the present time 
ond definition is not unreasonable by 
any known 


This sec- 


standard, and it may 
surely encourage productive research 
and a resulting clarification of basic 
issues. But it is perhaps too early to 
say. 

This brings us to the third sense 
the 


may be employed. It should first be 


in which term mental 


process 
pointed out that any realm of sci- 


entific investigation is designed to 
provide, among other things, a use- 
ful level of descripti 
Direct 


mentation, 


objective yn 


observation, testing, instru 


tech- 
direction 


related 
this 
employed within a 
seek to 
describe relationships between events 
The proble m of de 
scribing judgment can 


and other 


niques are 


When properly 
theoretical framework 


ste ps in 
they 


or phenomena 
similarly be 
considered to be one which inter- 
poses a set ol te h ques al d 
oretical system between two 
observables. Thus it is possible 
“describe” the kinds of met 
tivity usually characterize 
tive by means of math 
One may thereby apy 
which is at 


models roacha 
least 


equal to that of other competitors in 


level of des nption 
some respects, and rtainly superior 
in other ways. That is to say, 1 
trolled the 


information) and the output 


cone 


situations wherein input 
(judg- 
ment) are known or capable of quan- 


tification, one may postulate func- 
betwee! 
and output their 


quacy by determining the 


tional relationships input 


and assess ade 
w?ccuracy 
with which each is capable ol pre- 
dicting judgment. The present paper 
is directed at this level of description 
The term mental 
simply to a functional relationship 


which 


process relers 


accounts for consistencies in 


] 


response to divergent stimulus (in- 


formation) patterns. It is thus a set 
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of intervening variables, nothing 
more. 

which immediately 
arises from the foregoing discussion 
is that of the adequacy with which 
it is possible to describe the mental 
underlying clinical judg- 
In answer it may be said that 
the process is adequately described 
when a particular mathematical 
model quite effectively predicts judg- 
ments for any given set of informa- 
with the 
meaning of the word “‘de- 
although 


A question 


processes 


ment. 


tion This is consistent 


scientil 
considerations 
such as simplicity, generality, and 
the testability of derivations must 
be kept in mind. A major problem 
understanding of judgment 
ill in this paper be considered to be 
t of formulating the kind of 

which is sufficiently predic- 


scription, 


in the 


tive, yet useful as a vehicle for ap- 
proaching other related problems in 


Different 
need to be dis- 
and evaluated, 


the area of! 


kinds of 


] 


cussed 


judgment. 
models 
compared, 
and empirical findings are of course 
necessary. Subsequent sections of 
will consider specific 
judgment. One, the 
model, is relatively simple; 
another to be described is somewhat 
more complex. 


his paper 
models ior 


linear 


Following this, some 
empirical findings will be offered as 
illustrative of the research oppor- 
tunities which unfold. 

beginning the discussion 
of specific models, however, it be- 
comes necessary to justify certain 
that must be imposed 
that meaningful inquiry 
made into the judgment 
The restrictions center in 
the nature of the information avail- 
able to the judge or clinician and 
which the judgment is con- 
he restrictions do not 
seriously impair the realism of the 
judgment situa'ion as long as one 


Before 


restrictions 
in order 
may be 


process. 


ipon 


tingent, 
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can bring some ingenuity to bear 
upon certain problems of quantifica- 
tion, but even this point of view 
may be objectionable to sorme. For 
completeness, therefore, and in order 
to provide early insight into the ex- 
perimental procedures usec in the 
empirical studies to be later de- 
scribed, attention may now be di- 
rected to the problem of the nature 
of the information available to the 
judge. 


THE INFORMATION 


The information upon which clini- 
cal assessment is based may com- 
monly be expected to include any- 
thing or everything, depending upon 
the training and inclination of the 
diagnostician, and depending upon 
what is available or obtain- 
able. The lack of control over such 
information may be 


asset or a liability, depending upon 


easily 
considered an 


one’s orientation, and the accuracy 
of judgment may or may not be en- 
hanced through the inclusion of non- 
quantitative data (Ullmann & Berk- 
man, 1959; Holt, 1958; Luft, 1950; 
Meehl, 1954) the 
judgment domain or situation. 
What seems certain, regardless of 


depending upon 


the outcomes of empirical research, 
is that the uncontrolled use of clinical 
data, whether or not it 
quantitative ' 


exists in 
makes clinical 
venture. It 
matter of 
whether 


What seems equally certain is 


form, 
assessment an 
must of 
one’s values as to 
wise, 
that any seriously conducted scien- 


artist 
course remain a 


this is 


tific study of judgment which has as 
its purpose the description of the 
method of combination used by the 
judge must take place in controlled 
settings, i.e., in such a way that the 
amount, kind, and nature of the in- 
formation available to the clinician 
or judge can be completely specified 
in objective terms. 
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Controlling the judgment task to 
this degree has its advantages and 
its liabilities. On the one hand, re- 
the situation as described 
assures that each person is evaluated 
with the informa- 
tion. Ambiguous and equivocal cues 
are and all judges are 
thereby certain to have at their dis- 
posal the same information and no 
more. 


stricting 


respect to same 


removed, 


Che inferences made beyond 
this point are thus certain to have 
their origins in the data provided. 
The major problem, that of describ- 
ing the idiosyncratic method of com- 
and 
formation by 


bination weighting of this in- 
the clinician, is there- 
by clearly defined. Clinical judg- 
ments are, of course, often made in 
settings wherein the kinds of infor- 
mation available may include inter- 
view and projective test impres- 
etc. In addition, the kind of 
information available may vary con- 


sions, 


siderably from one patient to the 
next. This may be said to pose a 


limitation to the situation described 


Such information may be important 


in judgments, a point perhaps best 
made by Holt (1958), but unstruc- 
tured clinical judgment 
nonetheless make the contribution 
of such information experimentally 
The problem 
here under investigation would, as a 


situations 


impossible to assess 


result, cease to be a scientific prob- 
lem altogether. 

Let us therefore consider that the 
situation in which a clinician makes 
evaluations of patients is restricted 
che in- 
available is reduced to a 


in the following ways: (a) 
formation 
set of variables with respect to which 
all patients in the sample are evalu- 
ated; 0 the ex- 
pressed in numbers or in categorical 
responses; and (¢ variable 
Satishes as a proper- 


information is 


each 

the 
his scheme 
leads to a set of numbers or classifi- 


minimum 
ties of an ordinal scale. 
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cations for each patient, where each procedures to the problems of judg- 
number or class represents the de- ment has been suggested by Bruns- 
gree to which a characteristic, trait, wik (1947), and by Hammond (1955). 


| 


symptom, or biographical factor is Todd (1954) reports a study using 
present. One example of the situa- regression coefficients and the multi- 
tion satisfying these restrictions is ple correlation coefficient for a de- 
that of a symptom check list; an- scription of the clinical judgment 
other is that of a test profile. Rating process, where the task was to judge 
scale data are likewis rmissible, as intelligence from a selected number 
would be combinations of tl of Rorschach signs. While such 
types studies provide interesting implica- 

Having objectified the d on tions, it should be stressed that there 
which the judgments are * are serious limitations with respect 
based, we may no' Irn ynsid to the interpretation of results; 
eration of the model limitations which may be minimized 
or overcome only through a detailed 
examination of the rationale under- 
Ihe linear model is o n which lying the model, and through re- 


‘ 


judgments a scribed ' nle formulations or revisions of the 
weighted m of the values of the model, should this be necessary. So 
information availab! ra mn as to insure the appropriateness of 
the linear model as a device for 

characterizing the judgment process, 

we consider in detail some of its 
properties, and provide the particu- 
lar reformulations where necessary. 
In the first place, and by virtue of 
the experimental control employed 
the collection of the data, the 
source of reliable judgment 
riance is from the information 
supplied. This is in objective form, 
e.g., it appears as a number, a desig- 


nated category, a position along a 


continuum, et Often these data ap- 
| ir as test scores on a set ol proto- 


cols being judged. Assuming that a 
j ombined the information in 

“ar additive fashion, the multiple 

ssion analysis will be quite ef- 

as a tool for describing the 

idgment process; 1.€., the set of 

regression weights when applied to 

gression Ol whe he the corresponding predictors can 
weights to pplied to the i quite properly serve as a model for 
pendent variables are hosen as judgment Thus, the adequacy of 
minimize the error in mating the linear model can be assessed by 
actual depends nt variable fron ne nspection of the magnitude of mul- 
weighted composite tiple R. If the judge integrates data 
Application of multiple reg: in additive fashion as opposed to 
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configurational or pattern analysis, 
the linear multiple correlation will 
approach unity when corrected for 
attenuation. Lesser values of R sug- 
gest progressively lesser utility for 
the linear model. 

Secondly, it may be noted that 
the regression weights signify, with 
certain limitations, the emphasis or 
importance attached to each of the 
predictor variables by the judge. 
Large coefficients mean, empirically, 
that the corresponding predictors can 
account for large proportions of the 
variance of judgment; and a pre- 
dictor with a small beta coeffiicent 
contributes little beyond the 
tribution of other predictors. In 
practice, characterization of the 
judgment process by means of beta 
coefhcients three limitations: 
(a) since Js differ with respect to the 
size of their multiple R, direct com- 
parisons of sets of beta coefficients 
between Js is L 
beta coefficients do not account for 
all the predictable variance; and (¢ 
beta coefficients do not allow for the 
assessment of the independent con- 
tribution of each predictor. What 
would be more appropriate would be 
a set of weights which are compar- 
able from one J to the next, which 
are capable theoretically of account- 
ing for all of the predictable variance, 
and which carry exact interpreta- 
tion in terms of components of vari- 
ance. 


con- 


has 


not 


meaningful; 


RELATIVE WEIGHTS 


The formulation that is required 


is fortunately not difficult. Beta 
weights (8,;) can be converted into 
a set of relative weights (w,,;) which 
have all the advantages described 
We show first that the variance of 
predicted scores (which in this paper 
refers to predicted judgments) can 
be partitioned into two sources; one 
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a sum of squared beta coefficients, 
and the other a residual of weighted 
covariances 
Let 
x’,=the predicted score for an in 
dividual (protocol) in reduced 
standard form. 
the standard score of the ith 
predictor (on the protocol). 
the beta coefficient for the ith 
predic tor. 


1 N ‘ “2 
: >| > Bac. | 
\ 1 

rm in parenthesis is a 
variance-covariance ma- 


in be divided as follows 


— 
k k-1 

cit DY 8.8 x0(i>j 
é—1 jwl 


rst quantity on the right, 
when squared and averaged over in- 
dividuals, yields the squares of the 


8 coett ents Thus 


k 


_ 8. 77 ,, 


1 


since the x; are standard scores 
Similarly, the quantity is 
a weighted sum of the intercorrela- 


tions 


be a ond 


imong the predictors. Thus: 
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N 8 bkIl 


DL 2D Boborers 


tol jel 


oV- 


_$ 


wl » 


> 


bof 


Therefore, 


A 


& kl! 
Le Brit Do 2 BoBetain 


a | tol pel 


Oo", = 


It follows that the variance of pre- 
dicted scores is described by a simple 
sum of squared beta coefficients if 
and only if the 
vanish. One special case in which 
this will be true is that of orthogonal 
predictors 

Relative we 
follows 


covariance terms 


is defined 


note that 


ight, w 
First e€ 


V Pola taro 


g/ X Barw=R 


1 


Squaring both sides and dividing by 
R?. we get 


BoP oi 


R%o.19 


=] 


Therefore; in interpreting by inde- 


pendent components of variance, we 


express relative weight as 


where 
8..=the beta coefficient for 
the ith predictor 
the validity 
(correlation with judg- 
ment) of the ith 
dictor 


coethcient 


pre- 
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R*o1,2..42=the squared multiple 
correlation coefficient 
reflecting the best lin- 
ear combination of the 
k predictors in predic- 
tion of judgments. 

Finally, a description of the judg- 
ment process by means of linear re- 
gression procedures and _ relative 
weights allows one to go on to studies 
of varied sorts. Judges may be com- 
pared and contrasted with respect 
to their characteristic equations; and 
differences among judges may be re- 
lated to training, personality, and 
other factors that could conceivably 
affect the utilization of data. Many 
other problems immediately suggest 
themselves. 

The linear model may effectively 
be able to predict (or describe) clini- 
cal judgments to a very considerable 
degree, but there may be other situa- 
tions for which linear models are 
just as there must 
be many judges for whom more com- 
plex models are necessary. Let us 
now turn our attention to a second 
type of model. 


not appropriate 


CONFIGURATIONAL MODELS 
In very general terms, the config- 
urational model can be described as 


J =f | Xi, Xo, Xs, dies X;) 


wherein the exact functional rela- 
tionship involving the k independent 
variables may be described in any of 
a number of ways. As an example, let 
us consider a particular type of func- 
tion, one which shall be referred to 
as an interaction model, The interac- 
tion model describes judgment as 
an appropriately weighted composite 
of all possible first order interactions 
of the predictors. Thus we may write 


k e1 
J=Acet > > AuXiX; 


tel fel 


(i>j) 
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The inclusion of interaction terms 
in a model takes account of the pos- 
sibility that for a particular judge the 
interpretation of one item of informa- 
tion may be contingent upon a sec- 
ond. By extension, other interaction 
Chus it 
is possible to include terms involving 
higher order 
postulated 

the 
models 


models suggest themselves. 
interactions or other 
functional relationships 
information 

represent 


among variables. 
Such 


judgments in a quite proper sense. As 


configural 


the postulated il terrelationships be- 
come more complex, the judgment 
becomes less dependent any 


information and 


upon 
single category of 
less dependent upon a 

weighted sum. Instead, the 
ment comes to depend upon the 


simple 
judg- 
con- 
figural properties of the profile, and 
these may approach a high degree of 
uniqueness 


Correspondingly, the 


person judged is increasingly evalu 


ated not so much with resp 
relerence group ol others eart 
Same score, but rather w 
to the 

parenthetically, a 
additive 


pattern 


model, some 


be judged as functionall) 
though the scale scores dif 
edly. 

Other configurationa 
be postulated, some ol 
involve interaction a 
but 
quire a transformation of one or more 
of them into a different set 
The 
formation 
tion that 


that linear functions best describe re- 


pendent variables, 


ipparent need for 


arises trom the assump- 


clinicians seldom believe 


lationships between information and 
It may 
in fact, be closer to the truth that, at 


characteristics being judged 
least for some classes of inforn 
extreme 
judgment than are scores in the mid- 
dle range. And will of 


tion 
ition, 


scores are more decisive in 


clinicians 
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course be expected to differ amongst 
themselves. For some, scores above 
or below some arbitrary value may 
carry no added significance whatso- 
ever In selecting configurational 
study, it therefore be- 


comes necessary to take into account 


models for 


the great individual differences which 


may exist among clinicians and to 
construct that type of model which 
The appli 


cation of such a model to a well-con 


trolled 


appears most promising 


situation may ultimately be 


caps ble 


~ accounting for all but a 


trivial fraction of the variance of 


judgments 


SUPPRESSOR EFFECTS 


of the difficulties inherent in 
beta 
when it is 


erpretation ol coeth 
ipparent 
some statement con 


relative 
hy 
: 


causality or 


\ beta 


lictor with which it 


ed correlates highly 
| 


rion and 


may be hig 


be iuse tf pre 


with 


assox 
is relatively inde 
But it 


even though the 


ot other pre dr tors 


be high 


with which it is associated 


very poor predictor of the 


is its correlation 


so lor y 
valid predictors is suf 


gh The 


Statistics 


point has been 


textbooks 
ssed in a few recent 

lications (e.g., Lubin 
he example presented in 
ir (1955) will serve as illustra- 


me the following 
400 
OO 
= .707 


where is a judgment criterion, 


an ire predictors (informa 
tion). 

Solution of this matrix leads to 
Ro 12 = .566. 


SOU, Db 2 566, 
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Quite sntly, the second pre- relation, and with reference to the 
extreme case. If the validity coef- 
is attributable exclusively 


dictor 18 a suppressor 
negative weight ause it ; in noient r 
for variance in to a second predictor, 
hat is independ 
Beta coethcient 
quately descri 
ess for this « 
566 even if 
unay uilabl 
judge! Indeed 
having 
ocols to 
il pre ict 
ix in 
me of these n 
cant betas 
[he use ol 
ites this difhe 


1c ribed, 


uld appear therefore that 
veights as defined do in fact 
meaningful descriptions of 
f judgment, 1.e., with 
relative importance 

is items of information 
the judge, and without 

i spurious efiects olten 
with multiple regression 
Further, it would seem 

eral mathematical proof 
ision is not difficult to 
These considerations apply 


near and configurational 
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em becomes ap- 

is recognized that 
models may be capable 
for judgment variance 
ficiency. Consider, for 
given model which is 
te in predicting judg- 
the information given. 
we may be said to have 
“described” the 


process, but one im- 
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portant qualification is necessary. 
Even in the hypothetical situation 
in which prediction is perfect, one 
cannot conclude that the mental 
process has been ‘‘discovered.”” By 
definition, of course, this point should 
be obvious, but it is well to point 
out that even among sets of mathe- 
matical relationships (models) which 
are ostensibly different, there may 
be some which are in fact equivalent 
with respect to explanatory power. 
An example may serve to clarify 
this point. Let us assume that for a 
given judge, and for two informa- 
tion variables, X and Y, the judg- 
ments can be independently pre- 
dicted from X and Y with 95% ac- 
curacy by the following equation: 
jJ’= 


PV X*+ ¥2+2XY 


We note that the right hand term is 
simply the square root of the bi- 
nomial (X+ Y)?*. Since X¥+Y 
=+v X?+ Y?+2X J, it follows that 
the equation J’=X-+ Y will account 
for the judgments equally as well as 
the expression 


J’ a 4+V X24 ¥242X) 


It is therefore no more reasonable to 
conclude that the judge is in fact 
“using’’ one particular combination 
of the information than it is to con- 
clude that he is using the other. One 
would have to establish different cri- 
teria two 
such representations may be intelli- 
gently made. 

But this should not be a trouble- 
some point. Mathematical 
are designed to provide a scheme 
whereby one set of events may be 
satisfactorily predicted from an- 
other, and whereby testable deriva- 
tions may lead to more complete 
theoretical understanding of the phe- 
nomena. Such models therefore con- 


before a choice between 


models 
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stitute a level of description and ex- 
planation which suffices for scientific 
not required of 
models that they bear any semblance 
“actual’’ state of affairs, 
either within the organism or else- 
where, nor would this 
lead to a better understanding of 
nature 

This: may be more clearly seen in 
relation to an example from the phys- 
A chemist has the task 
He per- 


of operations (or 


purposes It is 
of some 


necessarily 


ical sciences 
of describing a substance. 
number 
the and deter- 
its chemical composition. It 


forms a 
tests) on substance 
mines 
turns out to be a relatively simple 


task The 


chemically, as CaCO, 


is described, 
But the work 
omplete, for a different set of 


substance 


is not 
tests) might have pro 


level of descrip 


operations (or 
duced a different 
which not be 
the but 
more suitable for other contexts 


tion; one may neces 
which is 


The 


mineralogist, for example, would per- 


sary for chemist, 


form a series of tests to measure the 
optical properties of the crystal, or 
he might examine its hardness, solu 
bility, 
crystals identified by the chemist as 
fact 


one 


and other characteristics. Two 


be somewhat 
the 
aragonite, 


CaCO; might in 


different from another to 


being 


mineralogist, 
the 


one 
These two 
have the same 
but they differ 


in molecular structure; this being re 


calcite. 
fact 


structure, 


othe I 


crystals do in 


and 


chemical 


vealed by opti al and other tests. It 


that different levels of 
possible with regard 
to substances, and that each has its 


peculiar 


iS app irent 


description are 


advantages and shortcom- 
ings. 

In mineralogy, calcite is commonly 
as a paramorph of arago- 
This word is used generally to 


described 
nite 
describe a substance having crystal- 
line structural properties which dif- 
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fer from those of another substance 
with the identical chemical composi 
tion.” 

We have borrowed the term para 
morphic from mineralogy and em 
ploy it in relation to representations 
of human judgment The 
may not be cor 
tions are nm 
matical 
ment process 1s 
that approaches 
scription ol 
helps to icco 


what is observe: 
, \ sample profile for the judgment of 
tain propert 

intelligence 
the judge, just 
formula ‘“‘explai 


selected. Two of these made 


iot all roperties , as . on 
, to ments of “intelligence” of 100 


rs cit substa: persons on the basis of a set of nine 
ys predictors or sources of information. 
he remaining two made judgments 
{ ‘sociability’ of 150 persons on the 


basis of profiles containing scores on 
( hemi al anatys 


sO , eight selected Edwards Personal 


description ol 


Preference Schedule (EPPS) vari- 


] 
incomplete, tor , 
ables In all cases, the judges re- 


erties of judgme . 
- irned after an intervening period of 
KI ' 


il days and made a second set 


or how accurately - 
ments on the same profiles. 
process has beer . a 
es profiles are shown in Fig. 1 
tern paramorphic rept tat ! : . ) 
used in relation t rmet woul a = lee 
ar Application of standard multiple 
idequately indicat th 
regression procedures reveals that a 
best linear combination of the pre- 
Tue Linear Mope t: Repri lictor scores correlates .948 and .829 
SENTATIVE RESULTS with the judgments of Judges 15 
d 18, respectively.? But to what 
extent can it be said that the linear 
model adequately characterizes the 
idgment process for these judges? 
Che reliability of judgment for these 
two is .876 and .836 respectively. 
Correction for attenuation results in 


ot all 


Alls 


As illustrative of 
of the methodology 
four persons wl 
judges in studies 


2 More 


* Corrected for shrinkage Multiple Rs 
‘ computed from a cross-validation sample of 
calcite that results from alteratio ite protocols were .937 and .837 for these /s re- 
which is paramorphic spectively 
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tion achieved by the multiple regres 
Oeference sior model? 
There are some difficulties in at 
g to ascertain, from the state 
its of the judge, a subjective im 
a cognitive process In 
Seige | | star in which persons are asked 
Change ; make judgments of intelligen¢ 
abinehancettis 1 al ility from the sets of 
that have just previous! 
ribed, it is rarely true tha 
i high degree of co 
ments he may make 
» relative importance « 
V iriables Ir SOE 
ilt 


mn 


‘ es ol communicatsi 


hen, for example, a judge 
coefficients are ; } d necessary to relate some 
.907. Thus it m > seen, j 1€ mplex or configurationa 
case of the t yl tha lines 1% iat he feels best describe 
additive combinatior t ! O71 “method of combinati 
(once a be I I ion t iltern ve is to isk the judge t ) 
weights is know: illov he predi listribute 100 points among the 
tion of judgmen itl rtua om irces formation available ar 
plete certainty residual or in such y that this distributio: 
Variance is trivi in comparisor1 “fle t | vest of his knowleds 
with that which redictable from _ the re! importance of those v 
the model | 
case is some 
unreliability « 
into 
17.7' 
counted for by 
therefore, 
while qu 
judge, is 
second. 

A second type 
asked with respect hese ju I juately describable 
Is a judge able to describe hany weighted im of the informatio 
ascertainable degree of ili \ withstanding such attitude 
manner in which he utilizes is , subjects used as judges 
tion in arriving at his judg: ot stud pt thy nderw 
its present form this question is prob nonly report satisfaction 
ably unanswerable, but it is quite fidence the procedure, and with 
proper to ask a related one Con- it ppare! relationship to their 
cerning the method of utilization of attitud neerning the complexit 
information, what correspondence ex- of the me combination they 
ists between the verbal description believe themselves to be using 


offered by the judge and the descrip- The number points assigned in 
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iMORPHI( 





this way 


V iriables 


he fe 


MOjeCcIive We 


show! 


a 
noun 
l hese 


whic 


infort 
to ipprox 


is determi 


Results fron 
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izes 


sub ective 
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omparison of relative and subjec- 
in the judgment of intelligence 


The two examples from the socia- 
lity experiment can be used to 

istrate the same phenomena. The 
relevant information is described in 
lable 1 and in Figs. 5 and 6. 


\ CONFIGURATIONAL MODEL: 

REPRESENTATIVE RESULTS 
Data for one type of configura- 
| model comes from a study by 
1957). On the basis of 
and rather complete inter- 
ews, Martin was able to obtain 
ly clear statements from a set of 
ounselling psychologists. These 
ents expressed the manner in 
which the psychologists believed 
they were utilizing the set of eight 
PPS variables in the prediction of 
sociability. Of the five judges, Psy- 
t D has been selected for 


INTELLIGENCE & SOCIABILITY 


Atte: ated ~—— 
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RELATIVE 
™~ 


WEIGHT 








Fic 
tive weights in the judgment of sociability 
(Judge 3). 


5. Comparison of relative and subjec 


His verbaliza- 
tions with respect to the judgment of 
sociability follow 


illustrative purposes. 


As might be expected I generally look for 
some over-all patterning of the test variables 
Although I do hold in mind certain standards 
and/or tendencies. In 
sion when I refer to high I mean at least 1 SD 
above the mean. For a rating of high Soci- 
ability, i.e., 7, 8, or 9, I would generally expect 
at least two or three Exh 
+ Het.) to meet the criterion of 1SD above 
the mean with others at least in the middle 
range or pointing in the direction of high. I 
would also expect for this rating that Aba 
in the average range or in the direction of 
The more that Exh + Het 
the high extreme and Aba. the low extreme the 
more apt I am to make a higher rating. In 
this concept the other four scales act in pairs 
For example, if the above conditions are met 
and Def. and Suc. are not in either extreme the 
rating remains unaffected. If both are ex- 
tremely low they tend to add to the rating and 
if both are extremely high they tend to detract 
from the rating of high Sociability 

The scales Chg. and Aff. are also considered 
as a pair but add or detract nothing to the 
ratings unless both are quite high or quite 
low. If both should be extremely low and the 
other conditions for a high rating are met | 
would suspect the reliability of the test 

In the case of a low rating (1, 2, 3) I would 
generally expect a somewhat opposite pattern- 
ing. For example, here I would expect Aba 
to be quite high with pair Suc. and Def. in the 
middle range or pointing in the direction of 
high. Similarly, I would expect at least two of 
the three scores (Dom., Exh., and Het.) to be 
in the middle or low range with none of them 


the following discus- 


scales + Dom 


be 
low 


+ Dom approach 
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The higher Aba. along with 
and Suc. and the lower the 
Het., and Exh., the lower the 
and Aff. again have 
unless they are significantly low 


extremely high 
the pair Def 
scales Dom., 
rating. The pair Chg 
little effect 
and then they tend to support or add to a 
low rating. 

In rating within the average range I look 
for the significant scales Exh., Dom., Het., 
and Aba., not be extreme in either direction 
While the variables are considered in relation 
another 


to one a high score on Exh. offsets a 


high score on Aba.) they contribute to my final 


rating whether singly or in pairs 


the 
it may be argued. that the set of vari- 
ables underlying Ds judgments must 
take into account the following con- 


From foregoing description, 


siderations: 


1. Interaction. Certain of Ds statements 
imply an interactive or multiplicative re 
lationship between two predictors and 
the judgment criterion. A good example 
of this is the statement, ‘The scales Chg 
and Aff. are also considered as a pair but 
add or detract nothing to the rating un 
less both are quite high or quite low." 

Nonlin Since D has stated that 


variables are of most importance when 


aruly 


their values are high, and that values 


between tle of the mean are usually 


wed, an exponential function should 


ig 
be more predictive of judgment than a 
linear 
As a result, the following predictors are 
defined 
X, Ab 
Xs | 


X, Heterosexuality 


one 


isement 


xhibitionism 








~ 











Fic. 6 
jective weights in the judgment of sociability 
(Judge 6). 


Comparison of relative and sub 
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X, Dominance 
X, Change X Affiliation 
X, Succorance X Deference 





Each of the six predictors can then 
be subjected to a nonlinear trans- 
formation, the result of which is to 
correct for the stated tendency of 
the judge to discount predictor scores 
near the mean of the distribution, 
and to emphasize them increasingly 
as they become extreme, up to 
limit Ihe resulting variables 
then weighted according t 


Def 


squares proc edures 


As in the case of the linear model, ». 7. Comparison of relative and subjec- 


weights, using a nonlinear model 


t is 0% e npute lat > . ~ 
tm is f ssibl to compu relative ian D (From Martin, 1957). 


weights and to compare these with 
the weights assigned subjectively 
his comparison is shown in Fig nee to the others mentioned. This 
We shall discuss the data short], neerns the stability of the judg- 
What classes of problems can be ment process. Can a “linear’’ judge 
attacked through the technique of » taught to be more complex? Can 
the configurational model? Asinthe any judge be taught to make more 
case of the linear model, it furnishes ient and more accurate use of in- 
a description of the relative im ormation? Will this generalize or 
portance (to the judge) of the vari transfer to other judgment situa- 
ous sources of information available ion What are the personality 
But greater latitude is possible. Con- characteristics which differentiate the 
figurational models are capable of flexible judge from the unchangeable 
handling the complexities and pat- one? Some of these problems are 
terns believed by many to be an es currently under investigation and 
sential (if not natural’) part of the results will appear in forthcoming 
judgment process. Thus, one may publications 
ask whether a graduate course in Finally, it may already have be- 
psychodiagnostics or in personality come apparent that the issue of 
assessment was effective in produ nomothetic vs. idiographic approaches, 
ing students who are ‘configura first proposed by Windelband (1904) 
tional” in their interpretation of and discussed more thoroughly else- 
case material where (Allport, 1937; Sarbin, 1944) 
\ second class of problem is that may be approached in one of its 
of individual differences. Do persons major aspects through the use of 
differ in the type of model which judgment models. If the argument 
most appropriately accounts for their is confined to that of “method of 
judgments? If so, in what respects? combination,” as suggested by Meehl 
And what proportion of these dif 1954), representational models are 
ferences can be attributed to specihc capable of providing some strik- 
training? To personality? To intel ingly clear evidence. Martin (1957) 
lectual characteristics? presents some interesting findings 
There is a third class of problem, in this respect, and it may be well 
one which is at least of equal signifi- for illustrative purposes to return to 
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Psychologist D, discussed above, 
and ask what the data suggest 

The first thing that might be said 
with respect to this psychologist is 
that, after having defined the pre 
dictors to his liking, he is 
astute as he might be in attaching 
subjective weights to them. By this 
it is meant that the dis« repancies be- 


not as 


tween subjective and relative weights 
are by no means insignificant in an 
The 
made even clearer by 
other way 


absolute sense matter can be 


Stating it an- 


Given the variables in 
question, a computing machine would 
come closer to reproducing Ds judg- 
than he could come 
were he to do the weighti gy as he 
says it should be done. What 
of particular importance is the 
that both of the 
bles were overevaluated in this re- 
Perhaps one likes to think of 
himself as being more complex than 
he actually is. 


ments himself, 
may be 
tact 
interaction varia- 


spect. 


Since the computer does as well as 
it does in reproducing the judgments, 
an example is here provided wherein 
a single consistent set of rules of 
combination 
successfully applied the many 
and 
judgments may 


(nomotheti may be 
varied cases resulting 
well differ from 
“configural”’ or “‘clinical’’ judgn 

by amounts which are quite trivial. 
It is not impossible that this phe- 
nomenon will OCCUF @ven in 1 
wherein the judge is schooled in the 
idiographi 
himself to be 
ingly. But 
the earlier ones are perh ips re kless 
at this point, particularly in that the 
error variance for relative and sub- 
jective weights is not presented for 
these data. The purpose of the paper 
is more to illustrate the applic ation 
of a methodology than to argue for 
reforms in clinical psychology from 


evidence based on a sample of Size 1. 


istances 


tradition and _ believes 


functioning accord- 


this generalization and 
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What, additionally, may be said 
of the model de 
veloped for D? We may legitimately 
ask to extent the substitution 
of complex variables in the place of 


( onfigurational 
what 


simple ones effectively enhanced the 
thereby 
relative efficacy of two 
The 2 (corrected for shrink- 


age), using the configurational model, 


prediction of judgments, 
testing the 


models 


is .88. This appears high, and it may 
well be, except lor the fact that the 
model to D's 


Dp 


corrected KR 


application of a linear 
judgments results in a 
of .91 

urational 


Thus, substituting a config 

model for a linear one 
changes the proportion of predicted 
variance of judgments from 82.81% 
to 77.44%: a little better 


! And considering chance 


loss of a 
than 5% 
factors, the least that can be said is 
that there is 


over! 


no demonstrable gain 


the linear model 


SUMMARY 
This 
the 
tion is utilized in decision making or 
itions It 


been concerned 


which 


paper has 
with 


manner in informa 


in judgment sitt is shown 
that 


way ol 


mathematical models provide a 


describing mental processes 
which would otherwise be accessible 


only through introspection or electro 


physiological techniques \ linear 
model and 
are des« ribe a 


ished for 


possible the 


a configurational model 


j 


nd illustrations furn 


each. Such models make 


testing of hy potheses 


met 


diffe 


concerning combination, 


iod of 


individua rences in judgment 
ability, ‘cts of training, 


ality 


per Son 


correlates, idiographi inter- 
materials, et 
that the para- 


representation of judgment 


pretation olf case 
It may be 
morphi 


said 


to otter 
bilities for 


appears interesting possi 
and theory as 
well as for applied endeavors. Models 


which attempt to describe the method 


research 


of combination of information in de 
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ion which would otherwise be 


difficult. And few would dis- 
with the suggestion that sound 
n of the decision process 
fundamental to a complete 


nding of man. 
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SCALE FOR CHILDREN: 
RESEARCH 
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In the 10 years since its publica- 
tion the Wechsler Intelligence Scale 
for Children (WISC Wechsler, 
1949) has found icceptance 
among psychologists working 
children in schools, clinics, 
pitals. With the 
WISC, not only as a 
telligence but often as a clinical diag- 


wide 
with 
and hos- 
the 
measure of in 


wide use of 


nostic instrument, it seems advisable 
to take a careful look at the growing 
fund of literature « the 
WISC, its validity in its various uses, 
and its general characteristics as a 
measuring device 


oncerning 


THe WISC 

The WISC was developed as a 
downward extension of the Wechsler- 
Bellevue Intelligence Scales W-B 
(Wechsler, 1944 most of the 
items contained in the WISC are 
from Form II of the adult 
(Wechsler, 1949; Seashore, Wesman 
& Doppolt, 1950 Easier items have 
been added to the low end of the sub- 
tests to make it suitable for use with 
young children 

Standardization 


and 


scales 


The WIS¢ 


con- 


sists of 12 subtests gro iped 
Verbal Scale (Information, Cor 
hension, Arithmetic, Similarities, Vo- 


nto a 
ipre- 
cabulary, and Digit Span), and a 
Performance Scale (Picture Comple- 
tion, Picture Arrangement, Block 
Design, Object Assembly, Coding, 
and Mazes). In the standardization 
of the WISC all 12 subtests were ad- 


ministered; only 10, however, were 


1 The author wishes to express his apprecia- 
tion to Robert Allen Keith for his many sug- 
gestions and comments through the several 
drafts of this paper 
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used to establish the IQ tables. Digit 
Span an 


marily be« 


1 Mazes were omitted “pri 


iuse of] their relatively 
low correlation with the other [sub] 


tests of the Scale and also, in the case 


of Mazes, the ti Wechsler, 
1949, p. 6). Wechsler suggests that 
all 12 


Lt 


poss pie 


me factor” 


subtests be given whenever 
“because of the qualitative 
they add” 


When 11 or 


prorating 1s 


and liagnostic data 
Wechsler, 1949, p. 6 
12 subtests are used, 
necessary 


The WIS was 


2200 white 


standardized on 
American boys and girls 
the 


respect to rural- 


chose 1 to be 
1940 census 


urbar 


representative ol 
with 
residence, father’s oc« upation, 
irea. Some adjust 
» allow for the ‘re 
tion to the West 
d 100 girls were 


} age levels. 


cnosen 
5 throu 
mentally ] ncent 
in the sample, all children were within 
1 half months of their mid 


one ang one 
| he met 


ayes 
Except for the 55 


children included 


years 


was dr iwn 


tally deficient group 
primarily Irom institu 
Michigan, and New 


1 more lenient age standards 


tions in Illinois, 
York 
were observed rhe standardization 
field 


ers who worked in &5 different 


tests were administered by 17 
exami! 
communities 

The WISC 10's (Verbal, Per 
formance, and Full Scale 


based on 


are devia 
scores from 
The 
from subtests 
oS aled Scores by 
separate tables for each four-month 

(e.g., 5-0 to 5-3) and then 


tion norms 


other children of the same age 
raw 


scores 


obt iined 


are transmuted int 
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into 1Q’s with a mean of 100 and a 
standard deviation of 15 
WI S¢ karly 


varied from 


Early reviews of the 
the WIS 
somewhat qualifie | 
Delp, 1953b; McCandless, 
and the pred tion of wide 
Shaffer, 1949), to a 
rejection of the WIS¢ in favor of the 
S-B (Anderson, 1953 All of the re 


views were favorably impressed b 


reviews ol 


acceptam 


rather criti 


the care taker l ta 
Several specific « iticist 
tioned, however: the WIS¢ 
lacks any evidacence tor 
validity (Delp, 1953b; M 
1953; Shaffer, 1949 
temptation to d 

analy CS on 

1953) without 
tion for any in 
1953b; McCand 
1949): it 
tremely 


does not 
high above 155 
tremely low (below 45 
1953b and the su 


be too difficult for 


score 


btests 


dren (Dx Ip, 1953b he men il age 


was missed by Anderson (1953), Del; 
(1953b), and Shaffer (1949), an 
lack of Negro children in the 
ardization group was ment 
weakness by McCandless 
1953b) felt further 
1 verbal 
ible subject 


: , 
derson’'s rather critical revi 


Delp 
scoring olf certain 


( luded consider 


mentioned the fact that raw 


zero are given iled SCOT 

zero for the younger childre 

fact, ible to fi sever 
ipparent discrepancies Wecl 


t of the st ndar 


son Was, in 


statistical treatmen 
data 

Strong points brought out by 
reviewers of the WIS¢ 
to-date constructior Delp, 
McCandless, 1953) and its sta 
ization (Anderson, 1953; Delp, 
McCandless, 1953; Shaffer, 


ization 


were 


1953b 


1949 
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Mentioned also as strong points were 
the facts that all of the children are 
administered comparable batteries 
McCandless, 1953); the time of ad- 
appears to be shorter 
Delp, 1953b) and more predictable 
McCandless, 1953) than comparable 


tests; if 


ministration 


is easy to administer, in- 
to children, gives both a 
Verbal and Performance IQ, provides 


teresting 


[Q's directly comparable for various 

es, appears to have potential clini- 
and has an easily used man- 
u Delp, 1953b 


1 framework for 


cal use 


evaluation, A 
ord should be said concerning the 
umework within which this evalua- 
the WISC is conducted. In 
a number of articles re- 
with the 
10 years have also 


tion ol 
iddition to 
porting research on or 
WISC, the last 
shown advances in the methodology 
and 

(See espec ially 
1951; Raiffa, & 
Chrall, 1954; American Psychological 
1954; Cronbach & 
Che psychological test 


of psychological measurement 


theory construction 
Coombs, Coombs, 
Association, 
Meehl, 1955 
has come to be seen as only one ele- 
ment in the total process of theory 
The full value of the 
as a measure of a psychological 
variable depends upon how well the 
which it is 


construction 
test 


entire system in 


used 
stands up to both logical and experi 
mental test 

[his view of the over-all ‘‘validity 
of a test demands that (a) the area of 
the object world to be covered, (+ 
the nomothetic network containing 
the variable, and (c) the steps by 
which the test is demonstrated to be 

measure of the variable, be made 
that all assertions be 
subjected to empirical test. 

Chis article is a review of the lit- 
erature concerning the WISC since 
its publication in 1949. 
ire twofold: (a) to 


and 


public, 


Its purposes 
evaluate the 
WISC as a measure of various psy- 
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chological variables, and (6) to bring 
together for the user of the WISC in- 
formation provided by the p 


ade of research 


ist dec- 


Tue WISC as A MEASURE OF 
INTELLIGENCI 

Content Validity 

In practice, a great deal of weight 
is often given to the user's assessment 
of the content validity of the WISC 
The actual assessment is not simple, 
plic ated by the 
both in form ar 
between the WISC and the 
scales. As noted by 
1953b; Shaffer, 1949 
attributed to it the validity of these 
other scales 

The content validity of a 
volves a discussion of one's 


however, and is com 
similarity id content 
adult 


Delp, 
.it tends to have 


' 
others 


test in- 
intuitive 
reasons for suspecting that a test will 
measure a 


demonstrated “ by 


variable, and is 
showing that 
the test items are a sam uni- 
verse in which the investigator is in- 
terested’’ (Cronbach & Meehl, 1955, 
p. 282). In 
constructing 
validity by incl 

his opinion, will elicit 
ilar to the 
hopes to predict. S 
if the test items are j 
ilar to the stimuli 
elicit the 

would then expect the test to der 


certain 


che yl gist 


essence, the ps' 
a test involves content 
iding items which, 
beh iv 


beh ivior he e' 


pred ted 


I his pre- 
must be shown 


strate predictive validity 
dictive validity , how- 
ever. To achieve any final evaluation 
of a test in terms of content validity 
alone is to rely upon intuition as a 
criterion, 


fine record as a guide for exploration, 


and, while intuition has a 


its use as a scientific criterion has 


consistently led to misinformation 
To assess the content validity of 
the WISC, 


be defined 
Wechsler’s concept of children’s in- 


a universe of items must 


which is relevant to 
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telligence. Unfortunately, beyond a 
few general remarks (Wechsler: 1949, 
1950; Wechsler & Weider, 1953), no 
theoretical discussion of the « oncept 
of intelligence as it applies to children 
To proceed, the as 


exists in print 


sumption must be made that, at least 


its more general aspects, the di 
sion of adult intelligence (Wechs 


1944, 1958 


is applicable to chil 


nition of intelligence 
As far as the trait 
ec” is 


concerned 


which is judged to tap 
ingregate, or global cay wit 
uurposefully, to think ratio 
nd to deal effectively with his 
nent’’ (Wechsler, 1944, p. 3 
included as a potent: 
item Defined only at this 
s difficult to conceive 


ure of directed behavior 


might be if test 
rather 
gross level, it 
be definitely excluded 

issumption is made that 
nse to any intellective 
cted not only bv his get 
ce, but by other ‘‘nor 
factors such as “‘drive'’ 
While 


I ition.”’ 
sler presents rather convincing 


ve coordit 


luding such factors 


of this ur 


nts tor in 
iverse is | 
Wechsler 

» controlled for the dif 
, 


il etlects ol these 


exalt ples 


*“nonintelles 
including a wide v 


(Wechsler, 


tors by 

riety ol types olf 
1944 

As the te 


irate 


items 


st is construc ted, two sep 


appear to be i 


questio! S 


volved a) the sampling of the ur 


verse of relevant ‘“‘nonintellective 


factors bi the different subtests or 


combi itions of subtests and h 


the sampling of items within each 


subtest By 


theory, 


looking at explir it}s 


stated there seems to be no 
way in which the adequacy of the 


sampling of ‘‘nonintellective”’ factors 
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can be ascertained, for no statements 
are made to limit the possible range 
of tactors 

On a less formal level, however, 
there seem to be several factors ofte: 
included in any 


1 


ol intelligence, but not ly 


adequate 
represented in the test While these 
“omissions would be of 
WISC wer 
strated to have the 
they 
hypotheses if such v 
lacking in at 

1. Which test 
call for the ! 
learned material 


for the 


quence if the 
cle sire 


validity , might prov 


memor 
terial? Further 
lation rul 


which takes 


o-one re 


lation 

or which 

very short periods of t 
The degree te 

cluded withu 

in idequate Sal 

lar child is a prot 

ntelligence test 

large 


WIS( It is 


that 


other source 
oby 
the degree 
would have 
answer to 


Juliet 


rubies?’ 


the backgroun 
child might have 
/ 


matter what 
beer 
criticism could also 
‘ illing tor sper 
ting puzzles to 


ing a 


pene i] 


WIS 


lack any explicitly stated, organized 


In summary, the 


network of intuitive reasons for ex 


pecting it to show predictive validity 


other than the very broad assump- 


[INTELLIGENCE 
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tion of a general factor which enters 
into the purposeful solution of all 
problems—whether they occur in a test 
or in the child's life. While Wechsler 
speaks convincingly of other, non- 
intellective factors which enter sig- 
nificantly into a child’s actual behav- 
or in problem situations, there ap- 
rs to be little evidence that these 
factors are sampled in any system- 
This the 
to depend very heavily 


forces 


of the WIS(¢ 

whatever 
oriet ed and 
WIS¢ 


user 


demonstrated criterion 
construct validity the 


might have. 


If the use of the term predictive 
ilidity is restricted to correlations 
the WISC and some nontest 

ot predicted behavior ob- 

at some time subsequent to 

the WISC, 


are no relevant studies in the 


idministration of 


ure reviewed. This is very sur- 

is it is difficult to conceive 

iny situation in which the WISC 
ght be used that would not involve 
of behavior. As it 
this lack of explicit evidence 
of the value of the WISC in the pre- 
diction of subsequent behavior must 
be viewed 


pred tion 


| 
indas, 


as a major weakness of the 


test 
Concurrent Validity 
In general, reports of the concur- 
validity of the WISC have been 
restricted to correlations between the 


WIS( other test 


achievement or 


measures of 
intelligence. Most 
of the studies relating the WISC to 
other have 
oriented toward assessing the com- 
parability of the various IQ scores. 

Stanford-Binet. 


ind 


ntelligence tests been 


Studies reporting 
of WISC 
with S-B scores on different popula- 
tions of children began to appear soon 


after the publication of the WISC. 


the comparability scores 
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A summary of these correlations ap- 
pears in Table 1 

Frandsen and Higginson (1951) 
reported a study on 54 fourth-grade 
children and concluded that “IQ 
norms from the S-B and WISC are 
comparable at least within the range 
of one to two sigmas above and below 
the mean” (p. 283 
favorable and unqualified statement 
of the comparability of the WISC 
and S-B appearing in the 
An article by Pastovic a: 


Chis is the most 


literature 
d Guthrie 
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(1951) followed summarizing the re- 
sults of five unpublished master’s 
theses. They concluded “that the 
WISC 10 should not be interpreted 
as equivalent to a Binet IQ at age 
levels below 10 years of age since the 
WIS¢ is consistently 
than that of the Binet" (p. 385) 
Krugman, Justman, Wrightstone 
and Krugman (1951) found signifi 
cant differences between the WIS( 
Full Scale and Performance Scale 
[Os and the S-B IO at all age levels 


score lower 


TABLE 1 


STUDIES REPORTING CORRELATIONS BETWEEN THI 


Author 
Frandsen and Higg 
(1951) 
Krugman et al. (1951 
Nale (1951 
Sloan and Schneider (1951 
Stacey and Levin (1951 


Weider et al. (1951 


Pastovic and Guthr 


Clarke (1950)> 


Rapaport> 


Cohen and Collier 


M ussen et al. (1952 


Sandercock and Butler 
(1952)¢ 


Triggs and Cartee (1953 
Arnold and Wagner (1955 
Gehman and Maty us (1956 
“ame gr 


Childrer 


ferred to pay 


Stroud et al. (1957 Grades 
mean IQ d 
Schachter and Apgar (1958 S-B administers 


before WI1S¢ 


* Study conducted using Form M 


WIS 


n9o.4M SB 
100.2 Mo..-WIS« 


® V =Verbal Scale; P = Performance S ale; FS = Full Sx alr,. 
> Studies summarized by Pastovic and Guthrie (1951). 
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(5-15), which 
favor of the S-B 
the S-B and 
tended to be 
younger age levels 


further that ‘‘there is 


were consistenti in 
Differences between 
+ IS( Verbal Scale 
significant only at 
They 
a definite tend 


to 


com lude d 


ency for greater differences 
ith l 
ford-Binet IQ's,” and that differences 
between S-B and WISC Verbal and 
Full Scale 10s ‘‘tend to be ited 


with chronological age 


be associated w the higher Stan 


iSScx 


differences are larger at 


12? 


be noted that acl 


levels’’ (p 
It should 
not obtain an 10 above 15 
WISC without extrapolat 
the the S-B w 


lr} 


rms, while 
much higher scores 
explain in part the 
greater differences 
with the higher S-B 1Os 
Weider, Noller, i l 
(1951) also found that w! 
and WISC IQs ig 
related, ‘‘the Binet IO tend 
WIS¢ 10's for 
332 \ regressio! 
iggested relating S-B 
Scale 10s in which 
et pl is 11 Ay 
when S-B 
; the WIS( 1Os 
the S-B IQs 
1952 Mus 
berg (1952), and 
1 Lauber 


orrelatio 


were 


ire § 
higher than the the 


same childres 


equation was si 
WISC Full 
WIS«¢ 
cording to 
1Os 


equals 0 85 Bu 
this fe 

be low 7 
would be higher 

Cohen and 
sen, Dear 
Stroud, 
also re betwee! 
the S-B and WISC ! t lence 
that the WISC ter 


dren | 


' 
rinula, 
are 
than 
Collier 
ind Roser 


195 


Blommers at 7 
ported 
ore chil 
within norna Tie ul T nves 
lower than the S-B is presented by 
Kureth, Muhr and We 1952 
100 five ind six- 
ind by Lev nson 
ol 117 if wisl pre 
school children lriggs and ( 
(1953 
in the kindergarten of an independent 
school (S-B mean IQ of 124.11), and 


isyve rbe t 
their study of 
children 


in his study 


in 
year-old 

1959 
artee 
tested 46 rather select children 


} 
i 
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found WISC IQs to be consistently 
(Full Scale mean of 107.58). 
hey concluded further that ‘there 
is a marked tendency for larger dif- 
ferences between Stanford-Binet and 
WISC IQ's to be related to higher 
Stanford-Binet I1Q’s”’ (p. 29). 
Arnold Wagner (1955 
mined 50 children drawn at random 
elementary 
cluded that ‘‘so far as this sample is 
concerned, the relationship between 
IQ's obtained for eight- and nine- 
year-olds with the WISC (Full Scale) 
ind Form L Binet is not significantly 
different from the relationship be- 
tween IQ's obtained on Forms L and 
M of the Binet” (p. 93). The Verbal 
Scale related significantly better with 
the Binet than did the Performance 


scale 


lower 


and ex- 


from schools and con- 


Preschool S-B IQs were compared 
with the school-age WISC IQs of 113 
children selected at random from a 
clinic population born at a women’s 
Schachter & Apgar, 1958). 
Of the 494 children requested by mail 


‘ 


hospital 


o return for testing, 119 returned for 
both for 
other reasons. The resulting correla- 

see Table 1) between the 


tests; six were eliminated 
tion of .67 
S-B and WISC Full Scale 10s was 
reported to compare favorably with 
previously reported correlations be- 
tween pres hool and school-age S-B 
IOs 
[he comparability of 1Q scores of 
WISC and S-B when applied to 
entally defective children has been 
investigated by several authors. Nale 
1951) found the rather high correla- 
tion of .909 between the WISC Full 
Scale and the S-B, Form L, for 104 
defective children, while Stacey and 
Levin (1951) and Sloan and Schneider 
1951) report correlations of .68 and 
493 respectively. In general, the 
WISC Full Scale was found to score 
somewhat higher than the S-B for 
these defective children. 


tre 
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Sandercock and Butler (1952) com- 
pared the WISC and S-B, Form M, 
IQs of 90 mentally defective chil- 
dren and co that ‘“‘correla- 
tions obtained between the Stanford- 
Binet (M) and the three WISC IQs 
indicate a high degree of relationship 
between the Binet WISC Ver 
bal”’ (p. 104). 

Several of the conclusions and as 


luded 


and 


Luthors 


Hol- 


sumptions made by various 
were subjec ted to direct test by 
land 
There was no significant practi 
fect on the WISC IQs wher 
was given first and the mediai 
val between 

(b) There was a Sigi cal 


(1953 who found i! par 


the tests was sever 

t difference 
between the correlations of the S-B 
with the Performance and with the 
Verbal and Full S« the WIS¢ 
(in favor of the 
Scales . ( 
difference betwee 
the S-B with the Verb 
Scales of the WIS« ad 


no significant rel 


iles of 
Verb il 
I here was ! 


n the cor 


itionsh Dp betwee 


ge and the 
S-B and WIS¢ 
There was no significant 1 
between S-B IQ and the differe: 
between S-B and WISC IOs 
In general, the follow 


chronological 
between 


liffereiice 
[Os é 


1 
. ‘ nehir 
PCidtit ba | 


sions can be drawn fron 
about the comparability 
and S-B IQs 


1. Studies 


Delattre and 


M. LITTELL 


and IQ ranges are very con- 


sistent in showing that at least within 


ages 


a white American school population 
the WISC and Stanford-Binet scores 
are related to a significant degree 
Correlations between the WISC I ull 
Scale 
reported within the .80’s 

2. The WIS¢ tend to be 
lower than S-B scores for 
childret in the middle and upper 


ranges al 


and the S-B are predominantly 


scores 


the same 


d somewhat higher for de 

fectives This ippears to be particu 

larly true for younger children (below 
the hivher S B scores 

g the S-B as 


' 
correlations are 


a criterion, the 
found with 
the 


and lowest with 


highest 
the Full Scale IO scores, 
ighest with Verbal, 


Perf 
: 
’ 


next 


echsler-Bellevue 
the We: hsler W-B 
WISC overlap for the 
through 15 has Ik 
nvestigating the 


the WISC and W 


that 
and the 
years 10 


The fact 


Bellevue 


1 to several studies 
comparability of 
B scores | he cor 


ire summarized t 


ind Milstein 

the many sim 
WISC and the 
comparability 

not actually there, admis 
he W-B and WISC to 30 Jun 
School bovs They found 
e the WISC and W-B scores 
ited, the Verba! and 


are highly correl 
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Full Scale scores Oo the W IS( 

significantly higher (at the .01 level 

Che Performance Scales, on the other 

hand, were not significantly different 
Price and Thorne (195 

two groups ol! whit \ 

school childret 

the 114 

WIS¢ 


IO meat 


that th 


and Wechsler 
ive ranye over 


The two s« 


WIS¢ 


VCE ( 
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Scale correlations are reported in the 
70's and .&0's 
2. The W-B Verbal Scale 
tend to be significantly lower than 
the WISC Verbal Scale scores for the 
me child. It may well be that the 
WISC items are 


this age level 


scores 


more appropriate at 
individual intelligence tests 
In the following studies the WIS( 
has often been used as the criterion 

st which the other test is 


Uther 


vali 
The results of these studies 
Table 3 

Cohen & 
Pastovic & Guthrie, 1951; 

& Schneider, 1951) are reported 
which the WISC has been corre- 

1 with the Arthur (see Table 3 
I he Arthur, as might be exper ted, 
ppears to correlate better with the 
WISC Performance Scale 


the Verbal Sx ile 


Collier, 


than with 


length of time 

‘ister the WISC, 

nd Wiechers (1954) investi- 
possibility that the Colored 
ve Matrices could be used as 
re of intelligence with greater 
than the WISC and a similar 
f validitv. One hundred nine- 
Idren from four Indiana 

Is were given the Matrices and 
WISC in counterbalanced order 
thors concluded that 


high correlations (see 


“In view 
Table 
the ease and speed of admin- 
t would seem that the Col- 
Matrices will find 
xtensive use in the clinical 
of children” (p 144 
lowing the positive results ob- 
b Martin and Wiechers, 
ind Carleton (1955) investi- 
the degree to which the WISC 
S-B scores of Ss for 


OWVTEeESSIVE 


a restricted 
of intelligence (possible menta- 
tives) compared with performl 


on the Colored Progressive 
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Matrices. They found much lower the Columbia Mental Maturity Scale. 
correlations Group intelligence tests. Correla- 
Motivated also by the time factor tions between the WISC and the Sci- 
Jarratt (1956) investigated the rela ence Research Associates Primary 
tionship between the WISC and the Mental Abilities Test are reported 
1938 edition of the Progressive Ma yy Stemple (1953) and are shown in 
| 4 
up the entire fourth grad schoo Altus (1952) reported correlations 
Barratt found corre] ons ol between the WISC and the California 
699. and .754 lest of Mental Maturity “TMM 
Because of the small number She selected a sample of 55 Junior 


trices. Using 70 children wh: 


({ 

5 

studies reported, | dithcult to draw High School children so as to repre- 

nore than very te! tive co sent the entire student body as to 

ibout the relatior twee! sex, proportion in each grade, 

and either forn he Prog ive proportion of bilinguals and IQ as 

Matrices. It does appear, however measured by the CTMM Che cor- 

when the Colored fort ip relation of .81 between the WISC 

a group of child: with a ill Seale and the CTMM Total led 

normal sprea ra yr fairly her to conclude that ‘the WISC 

high correlatio ul x pe probably has considerable validity in 

ind that the Verbal and riormance f iparable school settings’ (p 231). 
Scales correl jually wel : cond study by Altus (1955 

Investigating s] fically the prol which was undertaken to test the as- 

of testing child: re ption that the verbal and non- 

1 Fill 5 verbal portions of the WISC and the 

reported study rrelat CTMM are significantly related, re- 

WISC with the Ammons Full k ported further correlations between 

| these two tests (see Table 4 Che 

100 students referred to the guidance 

ent by teachers included 36 


who were referred for special training 
| or the mentally retarded 


lt justified to conclude that 

1 comparable school referral 

the WISC and CTMM are 

comparable as to group 

it and roughly comparable 

to individual abilities.” 

but its parti lar ty] ol iestio While the three studies reviewed 

\s part of a study to determine ill report rather high correlations be- 

whether currentl: vailal s tween the WISC and group intelli- 

would predict school ach entfor gence tests, again the small number 

bilingual pupils on the rritory of of studies precludes more than the 

Guam, the WISC was administered ry tentative acceptance of these 
to a sample of 51 fifth gra f conclusions 

Cooper, 1958 » of th lchievement tests. Mussen et al. 

guage handicay ignificas orrela 1952) reported a study with a group 

tions were reported with the r of Ohio State University elementary 

International Performa: cale and school children correlating WISC 
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scores with ric mea peiling tests of the Iowa Tests of 
ichievement I rrelatiol c Skills battery” (p. 18). The 
vary from .29 . ict th tests were administered to 725 pupils 
the intellectual wa ited by rades three to six drawn from a 21 
the “highly select population” ma yunty area in lowa. All of the chil- 
well have affected the ol d « dr had been. referred for psycho- 
relations adve logical interviews and testing and 
b randse: id rinso : were in, or were thought to be 
found rather ) | I idle ra ie kind of school difficulty”’ 
correlation ide childs 18 The mean IQs were within 
normal range. All of the 

intercorrelations were calcu- 

| beta weights for the vari- 

ts determined The au- 

that the Arithmetic, Vo- 

Block Design, and Object 

i 


subtests were most efte¢ 


liction for both the orig 
1 a cross validation 
9 like pupils They con- 


their study gave no sup- 


use ol separate verbal, 


subtest scores in dif- 


the WISC IQ to 
m of achievement was in- 
by Robinowitz 1956 
| to discover whether the 

i | as me isured by 

scale is the one who lea 

ionship of opposition 
Robinowitz found a sig- 
fference (at the .01 level 
the WISC between those 
10 were able to learn the 
ind those who were not 
serial correlation of .609 
between the ability to 
lation of opposition and 

WIS( 

directly related to 
Mussen et al. (1952 
relations between teach- 
Stroud i . | | t f rating ol intelligence on the Hag- 
determine 1 ff witl f lson-Wichman Rating Scale 
whi h all or il ] ( bi { | ol telligence and the \ ISC of 64, 
the WISC sul ild b ised 53 nd .68 for the Verbal, Per- 


predict pertor: adil mance, and Full Scales respec- 


Comprehension, 
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In general it would seem that the sumption of a general trait which 
relationship between ability and underlies all test behavior rhe 
achievement must be recognized as broader assun ption of a general trait 
highly involved and complex, and entering into a purposeful behavior 
should be subjected to much further both in an it of test situations is 


investigation \t present it seems not tou y these studies 
safe to say only that the WISC re- Vonintellective factor Also basic 


lates to scores on certain types of aca- to Wechsler’s theoretical position 1s 


demic achievement t r certain ption that the particular 
groups of children quite I] n t ised in the WISC tap not 
general, the Verbal he | | a niy ( | ! lh ence, but other 
relate to test-measured academi “noni lective’ factor Some of 
achievement better than 1e T- ese factor pecif » the par 
formance Scale ular it t ‘ cific skills 
more 
Construct Va ki ; ‘ ind a ill of 
While an attempt at a full appraisal ibte “drive” While 
of the construct validity of the WIS( 
would go far beyond the scope of thi ral theory in j nting for 
article, a few | 
order. 
Concerning 
Cronbach and 
that ‘‘unless 
thetic) makes cor wit] ser i ts use nection 
tions, and exhibits expli pul tween tl rbal | Performan 
steps ol intere! I ru ilida i Vechsier 195 tentativels 
tion can { laimed”’ (1 1 t 3 tl i r is measured by 
present, since little inde] he adult is a verbal compre 
tionale exists for it would fa r and onverbal or 
seem that only a fev her g ral ictor (variously identified 
hypotheses could be iv fror h " perfor nonverbal, spac 
conceptual framework behind the ind \ or ganization 
WISC. In few studies i reana yault 54) reported a factor 
tempt to make 
ence explicit an 
General inte 
tion of the global : 
telligence is bas 
of the Wechsler \ 
1944, 1949, 1958: Wechsler & 
1953), and would i: pl 
WISC should correlate 
measures of general intelli 
studies discussed under | 
of Concurrent Validity lend suy » spatial- ptu ictor 
to this view of general intelligen respond hly with the Verbal ar 
should be noted however! th it these i: riorn 


studies lend support only to the as- Lotsof mrey, Bogartz, and 
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ill subtests but Coding, Digit Span, 
and Mazes, at the 74-, 104-, and 134- 
year levels. These figures range from 
86 to .96. The coefficients of internal 
nsistency for the various subtests 
59 for Comprehension 

Completion at the 7}- 


r level to .91 for Vocabulary at the 
year level. The standard errors 
easurement in IQ points for the 
levels for the Verbal Scale, 

c Sx ile, and Full Sx ale 

3.00 to 5.61 

Wechsler (1949) and Sea- 
1. (1950) warn the user to 
count the fairly low rel 


yme of the subtests in in- 


la- 


either the absolute sub- 
or relations between them 

at the 74 year level 
yulary, Picture Arrange- 
lock Design, and Mazes have 
ff internal consistency 

le Comprehension and 
pletion fall as low as .59 
ty of the test in general 


rease with age, so that 


vel 134 all subtests except 


0) and Picture Com- 
68) are above .70 

lity of the WISC scores 
year period has been in- 
Gehman and Matyas 
children were tested in 

le and again in the ninth 
ficients of stability for the 

s were: Verbal Scale, .77; 
Scale, .74; and Full 


lo Uther Factors 


iring device, be it a sur- 

or an intelligence test, 

nfluenced by factors other 

Wechsler (1949) and S or t than the ones the user wishes to meas- 

il 1950 rey t « ent ( i! While WIS(¢ users appear to be 
ternal consistency halt vare of this fact, few studies appear 
bilities corrected by tl pearma which give direct information with 
Brown formula) for all scales and for vh to evaluate any particular 
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WISC examiner-child interaction. dence that the WISC cannot 
Practice effect Holl ‘ 54), » applied indiscriminately to all 

in an attempt to investigate -e rroups wit! 

fect of a particular kinder il n I n nterpretation ol the IO score 


1iout considerable revisio 


gram on the IQ scores of children, yuthern Negro children. In coi 

found that both his control ane n with another study Young 
mental groups showed I l I na tts 1951 te sted 40 southern 
gains at the O1 le vel) i ; ] ro cl | nwho were selec ted as 
Scale 1QOs over what Lippe rom ( tr roup 1 sentative ol 
his report to be approximately a tu hese children were 


month period The probl r not led namic ct 


gested by this study of t prac } ria I I judgment of observ 
WISC Full Seale 10 


Ss on 


of the WISC gir ver relatively score of | roup was, however 


short periods Ot t las not ne IOHOW Ul yn these resu 

writer's knowl ; n subje ‘ouns | Bright (1954 

to further direct ir ition larger group of so n Negrotr 
In studies which hil 

S-B or W-B have be: 

in close te por il pre 

thors have ¢ 

significant 

WISC scor 

Holland, 

to generalize fr hes 1 to B ren \ 

the WIS« ver t ; I im t tl pplical 

which the test it lentical VI to children of bilingual Mex 

rather tha Ir rl lar t | con pared the 

well be differ t hy ld tter ( these children ith u 


especiall rue of pertor t te! ! hildre: quated for 


us (1953 


ltu 
ilst 


vy of the 


in which a t nd performan ind foun 
might be retai or of verbal ite: t| rbal ’ f the biling: 


which migh tak or roup were | r than the Perfor: 


j 


int he school ro iad . , e Scales yanhi Vv significant de 
re 
with others nce of nearly 17 points 
Variabli test situat f nce W four 
ria i .O ‘ i is lou 
possible et cts I liver ! ior ti ! ] ] While thi 
examiner’ ‘ } f ] . ‘ j . . . rt. 
tion is anoth prob! t hicl hilds he majority whom 
has no lV nea 10 tt ) re | o! rf sider ition 
mentally 
it the 
examiner and tl] hild Kercis ur in interpreting the 
cumstances of ca tr his Js obtained from any marked! 
is surprising 
these V iriables ippeal tt I I ' Ol 1959 idministered sey 
erally assumed ral intelliges tests to 117 Jewis!l 
reschool ind found that the 
eo at bblicati f th, : 
Range of Application o nd all 1 WIS( 


Lnree scores were 


The literature provides conside igher at the .05 level of confidence 
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wcey (1955), who reported an 

analysis of the WISC for ‘‘a 

le of 366 subjects tested at Syra 

State School who can be classi 

defective, borderline and dull 

p 149). 7 hey found that 

se children (a) relatively few 

misplaced with respect to 

resentation, and suc h mis 

is does occur does not 

sulhcient extent to al 

the subtest total 

for each subtest there 

vely abrupt shift from items 

pear to be quite easy to ones 

juite difficult so that there 

ly few items of the middle 
sles 


by St icey and Portno. 


i +) 
stigated the assumption 
il defective children will 
nses to the WIS( Vocabu- 

i lower ¢ nceptual 
line childres Iwo 
were tested 24 
27 borderline 
responses were 
functional, and 
representing incr 
ncept formation. Con 
m the borderline 
cantly less func 
more descrip 


( lren The possibility of 
the WISC Performance Scale 
| was investigated 

Shapiro 1953 
children were 

ical he ilth, sex, 

ive and Goodenough 
IO Group (a) con- 
with a 60 db or 
iring in both ears 

to significant lan 


pment. The test had to 


somewhat to make pan- 


ions possible. Groups 
contained children with 
ial hearing “ach child was ad- 

1 } Each child 1 
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WISC 


and (b) with panto 


ministered the Performance 
Scale; Groups (a 
mime 
with usual instructions 
found that Groups (a) and (b 
not differ tly from 
other, but 

lower than Group ( 


instructions, and Group ( 
lhey 
did 


Cal h 


nificantly 


signifi in 
were both sig 

The authors concl 
the WISC Perforn 
be used without 
valid measure 
deaf children, it 
a correction tact 
fects of the pant 
In any case, they 
ance Scale ( 
pantomime as 

Very young child 
are reported concerning 
bility of the WISC to tl 
very young child 
noted, however, 
‘‘mental age’ of five six or below 
would in effect be 
as few as four or 
liability of such 
open to consi ler 
order to use the 
more items need 
lower end of 
cism would, of c« 
use of the WIS( 
dren below the 


years. 


Summary 


1 T here 
WIS( 


children of mark I lifferent 


norms no ipplicable to 


sub 


groups such as southern Negro and 


Mexic in Americat chil- 


bilingual 
dren. 


A Soc 1weconomiic status ippe irs to 
be a significant variable affect 


IQ scores of young childrer 


as opposed to fifth-grade children 
such that the children of higher so- 


M. LITTELL 


cioeconomic status tend to obtain 
higher scores 


3. The WISC 


tively insensitive to 


seems to be rela- 
differences 
among mentally retarded children 


4. The WISC 


when administered with pantomime 


Performance Scale 


instructions to either normal or deaf 


children can be used as a crude and 


low measure of intelli 


Spur©ric 
puri 


j 


the WISC is administered 


with ‘‘: ental ages” below 
years, the IO scores can be 
~ 


to be 


» the limited number of “fun 


relatively unreliable 
tional’’ test items at the low end of 


the scale 


Short Form 


of the WISC 

[wo articles report attempts to de 

velop short forms of the WIS( 
ind Stacey (1954) made up 
ifferent short forms of the WISC 
from the WISC records of 365 chil- 
dren had referred to the 
Syracuse State School for evaluation 
whom there was no suspicion 
IO range 46 
They correlated each of these 
with the Full Scale IQ, 
correlations which ranged .64 


oO subte st « 


who 


been 
and ior 
of organk 


to 91 


involvement 


short forms 
finding 
Com 


foratw ombination 


prehension and Vocabulary) to .&8 
subtest combination of 
Arithmetic, Block 


Picture Com- 


ior a hve 
Comprehensiot 
Desig: 
plet Oo! 

Less hopeful results were reported 
by Yalowitz and Armstrong (1955 
who der three short form com 
binations from the WISC records of 


229 children 


and 


ived 


referred for 
child guidance clinic. 
Correlations with the Full Scale 1Qs 
5S to .61 


numerous 


reasons to a 


The authors 
low correlations may 


ranged from 

felt that these 
be attributed either to the ‘‘wide sub- 
test scatter found in WISC records of 
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emotionally disturbed children, 
the lower subtest intercorrela 


tions found in the WISC than on 

Wechsler-Bellevue" (p. 277 
Armstrong (1955) divided the 

cabulary subtest of the WIS¢ 

two short forms consisting l 

even words Th over 

correlation for all 

months to 14 years 

88. She concluded th 

reliability involved in 

alternate word li 

Vocabulary list 

when compared 


p. 414 


The Problem of 


The departure 
oncept of mental 
to comments and t 
of finding an MA 
scores. Grove 1950) felt } 
the publication of the WIS«\ 
il contributio Wechsler 
thrown the baby out with the w 
when he discarded the cor 
MA along with its use as a ‘‘; 
method of defining levels of t 
formance.” The auth 
vided a method by 
e score could be 
We hsle I 
} 


opp sed to the 


ale 


absolute intellig 
the MA concep 

ng a child of 

ire of his ow 

ona given test 

must be interpreted ; 
“specific aptit i is 
lined three 

which scores corres} 
age” can be cak 


Kolstoe (1954) compared tl 


formance of 29 third- and fou 
grade children (S-B IQ 116 or abo 


with 29 eighth- and ninth-grad 
dren (S-B IQ 84 or below 
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he 12 subtests of the WISC. Differ- 
neces significant at the .05 level of 
ificance were found on only three 
the subtests. They concluded that 
‘ir results “support to a consider- 
extent the generality of the 


tal age con ept”’ (p. 167 


lue WISC as a DIaGNostTi 
INSTRUMENT 


with the growth of 
sychology, tests previously 
hin a circumscribed area of 
are finding use as more or 


ral diagnostic instruments. 


ere 


I 


R 
he WISC is, of course, a relatively 
tandard sample of a child’s behavior 
ch, can be used as any other 
le Completely ‘‘disorgan- 
" behavior, for instance, will have 
ilar diagnostic implica- 

ther it occurs on the WISC, 
h, or during a clinical 
yond this use, however, 
lency to attempt to pre- 

de variety of types ol behav- 


scores derived from the 


Subtest Deviations 


might expect, the almost 
possibilities presented by 
iables have engendered numer- 
pothe ses about how these vari- 
relate to various aspects of a 
s behavior. The problem of de- 
i “‘significant’’ deviation has 
nsidered by Alimena (1951 
orted a method for ac hieving 
ylity ot scores on the 
subtests (for all Wechsler 
ind for evaluating their dis- 
sion, based on the expected degree 
rait variation within the indi- 
|. The author reported that the 
yn norms have been calculated 
WISC and are available on re- 
rom him 


Om fil 


i 
ferences between Verbal and Per- 
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formance scores Recognizing th ientally defective children due 
many WISC users tend to attribute to brain damage or birth trauma and 
meaning to any differences between giving no evidence of severe motor 
a child's Verbal and ‘riormance defect showed no difference, and 1 
Scale scores, Seashore 5 ir the br il! la ged children showe d 
to Wechsler's original and iz significantly lower Performance 
tion data to investigate the 1 ning scores han the undifferentiated 
of such discrepancies grouy 
was originally designed so that the rt] in ( hneider (1951 
difference between é ri ; ind { \ nd 3 Nn 1951 also 
IQs and average Performa: ound tl rformance le to score 
was zero. Seashore found tha he mificantl igl al » Verbal 
sigma of the differe: 
ages was 12.5 
ancy scores 
normal distril 
There were 
ferences in « 

Investigat 
group differs 
of deviatior 
no appre 
rural and 
the feeble-: 
a Performa: 
their Verbal scor 
the nine pare 
fessional 
showed an‘ 
Verbal IQ 
IQ. Mean Ver 
points higher tha 
for both grou 

Newn in 
gated speci 
difference 
scores al 4 } ‘ ) 
mentally det ve child: Phe nterpret I WIS 

, 


found that (a) mer ly defective they compared the 


children classed a imilia d f 5 ri , protocols with 
obtained significantly higher sco the patter: ibtained from WISCs 
on the Perfor 
Verbal tests 


8 07 


ihe dat: 


classed as undiffer occurring o1 
also performed S$! 


i® 


the Performance thar h rl n other I mncluded that 


tests, but to a lesser de files is not large 
familial (mean differer wa ‘noug ) wart rrediction in indi 
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vidual cas and, whil Js W raham (1952) wished to check 
tend to be grossly similar, tl linical the assertion by others that the W-B 
sign approach cannot rried over 1 WISC profiles of unsuccessful 
from the W-B to th readers and psychopathic adolescents 
It should be noted tabir similar. He went over the records 
Guertin (1951) in their rev f th { 96 unsuccessful readers (25% or 
W-B through 1! yncl h re below the mean of the Wide 
at { \ { ‘ we Achievement Test who had 

ither a Verbal or Perform- 

core of 90 or above, com- 

scatters with the 

but unpub- 

idolescent psycho- 

I reported no statistics 


} | 
that these patterns 


losely.”’ For the un- 
he found Arith- 
Information, Digit 
Vocabulary subtest 


he mean, and Object 
Completion, Pic- 
Blox k Desi 


1 Similarities sub- 


it 


mil, 


Only 


sS devi 
at the .01 


esults with 


Arith- 


Beck 
nvestigated the 
104 children re 
candidates for a 
the educable men- 
These children were 
groups a) Organ 

by neurological 
ected organics 
psychological 


organic (N=29 
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for whom there was no evidence of 
organicity 
uation developmental history. 
Eleven more children were added to 
Group (c) a year later. From 
parison of the mean Verbal, Perform 
ance, and Full Scale scores and of the 
intersubtest s« he concluded 
that (a organi s tend to score lower 
on the WISC Full Scale tha: 
organics, (6) organics tend to 
lower on the WISC Performance and 
Full Scales th in on the \ erb il Sx ale, 
(c) the possibility of organic damage 
10 


from psychological eval- 


or | 


acom- 


itter, 


non- 


score 


considerably as the 
drops below the 70-80 range 
the WISC does not show 
teristic 
organics as a 


increases 


pattern of subtest 
group iS 
nonorganic, 


possibl; 


tarded childret 


The Interpretati 
test Score 


As noted 
1958) assume 
tap not only yen 


specific factors 


nature these 
far from clear. Some hi: 
by Wechsler (1944, 1 


he considers these fact 


ot 
OS 


the adult scales 

interpreting the 

WIS‘ 

children, however, bs 

ment that the subtests seen 

ure different factors in children 
Wechsler, 1949 


found evidence te 


tests of the 


in adults 
(1941 
even within the ad 
tests do not measu 


») Suge 


at all age levels 

Nowhere in the literature « 
is there more than the barest begi: 
ning of the investigation of the vari- 
ous interpretive hypotheses It 
would appear that most, if not all 


overed 


M. LITTELI 


are based on an intuitive appr usal of 
the cor 
formal 


istr itors 


tent of the subtest and the in- 
observations of test 
Whilk some agreement 
ve found as to the most likely 
nterpretation of some subtest scores 
Digit Span 
Similarities 


admin 
might 
other subtests 
might produce 
lisagreement Even if one 
as to what a 
the 
validatior 
be n 
ol the co- 
sistenc\ 
ind 
In the last analy 


it any prediction 


nent 


m should measure, 
empirical 


It should 


nost, if not all, 


j 


oted 


inter! il con 


ich doubt on any 


individual 
than 


\ plausible 


in 


more 


SUMMARY 
las reviewed the lit 
ing the WISC since 
1949 Its purposes 
to the 
Various psy 


‘valuate 


to bri y 
WIS¢ 

ight out by the 
h. The WISC has 
' " 


erms of its validity 


past 


*, 1ts ¢ h ir 


ive been provide 1 


iate within the 
will 
all of the 


information 


no atte mpt 


general state- 


‘m in order 


rather important 





Aside, perhaps, from « 
between the WISC and 
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orrelations retarded on the assumption that 
Stantord they will respond to various learning 


sinet for normal white school chil situations in characteristic ways. 


dren, further investigation 


the problems discussed cot 
nificantly to our fun I 
both practical ind theore 
cerning the WISC and its 


in particular ! 


The WIS 


of any of _ How well does the WISC predict this 

ild add si response in a well-controlled, experi- 
knowled mental situation? 

tical, « 3. Much more systematic atten- 

SA hr n should be given to investigations 

the many practical problems in- 

the use of the WISC asa 

tt have a measuring device. There appears to 


idequate rational Mucl mor be strong reason to suspect that 


thought and effort need 
voted to putting the WIS¢ 
theoretical foundat 
both the assessment of 
tent validity 

dicapped by 


rationale 
The lack of 
1 


‘ ' . tive 
~ S prec ( ive 


ommMmon Uses 15 

ent, the test's ce 

validities are 
pport the 


this cr 


ALTus, GRACE | \ 
the Wechsler Intelligence 
; pP n 195 16 


Grace T. W 


(RACH 
verbal and 1 
a 1 WIS 
143-144 
Attus, Grace T. A WIS 
readers. J. consull. Ps 
15¢ 


ti WISC scores are affected systemati- 
on a fir ca by many variables other than 
but little information 
t nature of these vari- 
relationships involved 
Especially i! need of 
nvestigation is the effect 
WISC scores of (a) variables in 
relationship between examiner 
nee, (6) the circumstances 
imination, and (¢« repeated 

" itions of the WISC. 
the other hand, the WISC ap- 
a relatively well-stand- 
with many virtues. It 
nsistently well with other 
easures of intelligence, appears to 
ly accepted and used, and, in 
seems to merit further re- 

ind development. 
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ATION VS. FACTOR 


COMMENTS ON “INTRACLASS CORREL 
TERMINING GROUPS 


ANALYTIC TECHNIQUES FOR DE’ 
OF PROFILES” 


HAROLD P. BECHTOLDT 


I 
I 


E 
E 


j 


Phe purpose of tl is t 1954 The multiple group proce- 
analyze briefl ‘ itements | dur ply a computing technique 
sented in a paper by rd ha which may be used to define one, 
man, Isaacs, ch but usually simultaneously two, or 
titled “‘Intracl rrelat vs. f ) mposite variables or ‘‘factors”’ 

of observations Appro- 

ipplication of the multiple 

procedures will lead to any 

possible orthogonal or oblique 

‘“‘factors.”’ The necessary sets 
for groupin ro 1 probl of weights for defining the desired 
typology | reading factors’’ may be obtaining in a 
paper reveals f mil ( ber ways; one may use any of 
or incorrect pres t i tec! t il “complete centroid” or 
and procedural ter f tl rou] ntroid’’ methods with or 
technical matter f ot clarif without “rotational” transforma- 
other investigato nterested p o1 ne might apply cluster 
plying these two approa pt ls based on an inspection of 
lems of profile comparisons iv b lata. An investigator may use 


Ser ously | 


priori “‘theoretical’”’ or even 
sion will not consider tl inda vithin limits) an arbitrary formula- 
mental question of tl ! 3 f t for the weights or supplemental 
y scientific purpo " ! thod nformation such as clinical ratings of 
of grouping individuals « ba ises as ‘“‘hypertensive,”’ “‘neurotic,” 
of the “shape, level,”’ t or ‘‘psychotic,”’ the procedure used in 
ter’ attributes of a set « mibx he paper under discussion. 
termed a rofil he “‘oblimax rotational solution” 
Gleser, 1953 mply one of the several available 
is nalytical procedures, each of which 
ANALYT ee fines the ‘‘factors’’ by an objective 
Readers of the paper b rad ransformation of certain types of 
et al. may well ul he incor lata called “‘factor loadings.”’ As de- 
impression that tl rthogona fined by Thurstone (1947), factor 


troid method, t ( tro. thod loadings are the “orthogonal projec- 


FACTOR 


with an oblimax tions” of the variates, considered as 
multiple group method t r vectors, upon a set of special axes 
at least two, d nt method added to the system and named 
factor analys irly miscor “normals” or ‘‘reference axes.’’ The 
ception of the multiy roup an oblimax procedure is a method for 
centroid techni is dis “rotating’’ such a set of reference 
ods has been cla in pap ixes, either orthogonal or oblique, to 
Guttman (1944, 1952) and rt positions defined by a mathematically 
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stated criterion. 

The common misconception that 
the oblimax procedure! is restricted 
to rotating orthogonal reference axes 
may account in part for the inappro- 
priate comparison by Haggard et al. 
(1959) of the oblimax data (Set A 
with the data from the 
group method (Set B) shown in their 
Table 2, page 51. However, their 
discussion of these two sets of data 
in terms of factor loadings and pro- 
jections as well as their untenable 
distinction between 
and “orthogonal 
lack of understanding of the basic 
concepts of factor 
veloped by Thurstone (1947 
Holzinger and Harman (1941 

The computations for the Set B 
data and Set ( have 
checked by the writer and have been 
found to be accurate to within +.02 
Both of these sets of data represent 
orthogonal projections on a set ol 
“primary axes’ defined by the cen- 
troids of three ‘‘clusters"’ or groups of 


multiple 


“oblique space” 
space” Sugwest a 
de- 
and by 


analysis as 


data been 


profiles; such projections are named 
the ‘factor structure” by Holzinger 
and Harman (1941), although Hag- 
gard et al. refer to the Set B values as 
the “factor pattern” and the 
term ‘factor structure’’ to refer to 
the intercorrelations among the fac- 
tors (p. 52). The Set B and Set C 
data are derived from two somewhat 
different definitions of ‘‘clusters’’ of 
profile vectors 

The Set A values are entitled the 
“centroid solution, oblimax rota- 
tion.”’ This designation of the Set A 
data is ambiguous. The Set A values 
from an oblimax program ight be 
expected to be either orthogonal pro- 


use 


> jections on a set of “norm uls.”’ or 
oblique projec tions on a set ol 


mary projec- 


' pri- 
axes,’’ these oblique 

1 The use of the oblimax with any 
set of reference axes was clarified by Kern 
Dickman. 


BECHT' 


tions being referred to by Holzinger 
and Harman as the “factor pattern." 
However, the values could also repre- 
sent two other sets of projections. 
apparently are com- 
A data in 
addition, an unambiguous identifica- 
tion ol not be 


made 


Because of what 

putational errors in the Set 

these values could 
vithout 

f 


if calculations 


repeating the entire 


Unfortunately, even when all the 
computations are correct, the values 
ted by the Set A data and by 


B data probably are not com- 


represe! 
the Set 
parable sets of values for an oblique 


system. The oblimax data either are 


linear regres 


coefhicients express- 


ing the variates in terms of the factors 


the val 


when ies are oblique proje 


tions on the primaries or, as “factor 


portional to regres 
The Set B data are 
the factors and 


loadings,”’ are pri 
sion coefficients 

covariances betweer 
. : 
the iables. Covariances 


profile Var 


and regression coefficients are not 


generally comparable 
the 
and covariances for 
fined factors ar 
are related by a simple equation 
(Holzinger & Harman, 1941, p. 327 

the 
the 


values in 


numerically 


However regression coefficients 
any one set of de 


id observed variates 


Contrary to ! plications of 
statements in paper under dis 
the the Table 3, 


ire not re les int to this prob 


cussion, 
page 51, 
lem 

In a 


tions betwee! 


factor analysis, the distin 
reference axes and pri 
mary axes on the one hand and be 


tween orthogonal and oblique pro 
other can be 
fairly 
ferences in numerical values between 
the three sets of data labeled Set B, 

| ind Set F in Table 1 


an illustration of these dis- 


jections on the very 


important The obvious dif 


below 


Each of these three sets of 


data or “‘factor matrices’’ and even a 
fourth m 


pattern 


itrix of ‘“‘reference vector 


values” might be referred to 
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TABLE 1 


THREE ALTERNATIVE METHODS FOR REPRI 


ENTING A GIVEN Factor ANALYSIS SOLUTION 


OBTAINED WITHOUT ROTATION 


Set B 


Orthogonal Pro 
on Cluster Primary 
Axes 
(Factor Structure 
| li 


49 
18 


OS 


10 

11 

1 ? 

Note Data compute 
tions betweer factors 
Haggard et al (1959 


* The decimal points | 
two places to the left 


by some investigator as showing the 
“factor loadings’ of 12 MMPI pro 
by the 
perte! sive, 
ychoti Phese 
data were computed fri the 12 
M MPI profiles as published by Hag 
gard et al. unalities wer 
provided by 


files on three axes defined 
clinical groupings of “‘hy 


“‘neurotic,”’ and ‘‘ps 


The comn 
| \. Haggard No 
made in obtaining 


‘rotations’ were 


these results; a single grouping of 


] ide as speci! 


the profiles was 


Haggard et al. (1959 
The Set B data are the 


projections on the 


orthogona 
three centro 
cluster 


vectors, a set ol primary 


axes,”’ while the Set E values are the 
oblique or Cartesian projections o1 
The Set I 


the orthogor il projections 


these same cluster vectors 


data are 
on the reference axes or 1 
’ le 


each of which is orthogonal to 


ormals. 
ill but 


Set F 


Orthogonal! Projections ih? 
Commun- 
ality 


on Normals or 
Reference Axes 
(Factor Loadings) 


Estimates 


—16 .801 
01 840 
16 078 


09 0s 008 

O8 10 990 
11 —21 644 
10 06 801 


21 71 745 
15 52 .007 
10) . 52 9R3 
01 66 .973 
04 2 58 .850 
1959) in Table 1, p. 49. The correla- 
Part Il, Set B, of Table 2, p. 51, of 


E and F. The decimal is located 


(The sets 
axes and reference axes 
are each collinear with the “inverse 
vectors” of the other set.) If another 
definition of the ‘‘factors’’ were used, 
as in an oblimax solution, three (or 
four) additional ‘‘factor matrices” 
might be computed. Comparisons 
between the results for different 
the factors could be 
made in terms of any of the corre- 
(comparable) factor ma- 


one of the cluster vectors. 
of primary 


definitions of 


sponding 
trices 
The ‘ 


lactor 


‘communalities” or ‘common 
of the 12 MMPI 
profiles are shown in the #? column of 
lable 1. Since three of these values 
exceed unity, the upper limit for 
‘“‘admissable’’ values of the com- 

the analysis is a ‘‘Hey- 
wood" case and not a “proper” factor 


1947). The 


varian es 


I alitice Ss, 


(Thurstone, 


| 
anaiysis 
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puted 
ble 1) ex- 
ind of 


vreater 


presence of five residual 

from the Set B data of Ta 
ceeding .04 in absolute value 
the three 
than unity indicate that another fac 
tor probably should be defined; de- 


com 


communalities of 


fined, however, only if a factor anal 


ysis is considered worth making with 
intercorrelations based on 9 of 


tions (here, 9 MMPI scales 


serVva- 


DirEcT CORRELATION METHO 


new 


A purportedly pI 
the problem of determining 


profile 
groups is presented by Haggard et al 
(1959) through the use of ‘‘empirical 
criterion profiles,’ i.e., the average 
profiles for a set of cases grouped in 
accordance with 
rules which 
information (p. 51 
proach is hardly nev 
1941 makes reference 
in 1931 (Burt: 1937, 
A more basi 


by these iu 


SOTTIE 


may 


poll 
discussion 
comparison, with reference 
Table 2, of Set B 
covariances between 
and 12 MMPI 
Set C data 
three average criteriosi profil S 
12 MMPI profiles ihey say 
the methods agree « that 
results are practically identical, 
that they are not prepared to 

that either set is 
the other Apparently the 
authors are not aware of the fact that 
the results are not identical solely be- 
cause of the defining procedures they 
he in 
of communalities rather than 
in the matrix of correlations for the 
Set B data in the first source of the 
small differences 
is their definition of average profiles 
in terms of the usual MMPI scaled 
stand- 


their 
cluster 


1 


profiles) with theirs 


the correlations between 
ind 
that 
the 


but 


losely ‘ 


argue 
ipproximated by 


(p 52 


happen to have used sertion 


unities 


] he sec ond source 


scores rather than in terms of 


ard scores (over the 9 scales): thi 
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second source of differences 


intro 
duces into their definitions differen 
tial weighting of the profiles in terms 
devia- 


of unequal profile standard 


tions The ‘‘criterion profiles’’ can 
not, they s iggest, be considered 
ymputed 

is p 54 
ird deviations are not equal 
the 


on equalized means 


since the profile 


“correla 
the 
tor 
as well 


valence ot 
/ 


5 methods 


] 


corre! 


ind of 
method 
ibove 


tions 


lard scores noted 


as the ease ot cor 


iputation of correla 
tions with ils or subtotals of 


scores 1 well known and has been 


de 
1941 
Ric h irdsor 


Because Or 


discussed in these terms in some 
tail by Holzinger and Harman 
Holzinger (1944 
1941 others 


this equivalence, it is 


and 
not correct to 


suggest that terion profiles are 
meaningtul” as a 


factor’’ 


more 
“‘defir ol | i than are 
itions of 


( orrelat ions 


differences are only. 
onal ones or are differences 
This same basi 


the 


ions point 
nvolved in ambiguous 
t dealing with the existence 


‘ 


neral factor’’ in the footnote 


4, page 55. Furthermore, if 
preciable amount of “reas 
of variables to new groups, 

mentioned by the authors 
were to arise, the flexibility 
le in the method using sums of 
weighted correlations or covariances 
for forming new groups might well 
overbalance the initial computational 
the 


convenience of “correlation of 


sums’’ method 

a second direct cor 
that of the intra- 
the authors 


phasize the well-known transfo;ma 


In presenting 
method, 


correlations, 


relation 
class em 
tion to standard scores imposed by 
the definition of a product moment 
As a result of this defini- 
tion the product moment correlation 


correlation. 
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is limited to providing a definition of a priori profiles’’ (p. 55). ‘‘Theoret- 
‘profile shape’’; one cannot n ical"”’ profiles have been written and 
bine with this measure of ‘‘s] ‘the the profiles of individuals compared 
independently defir ro i with “hypothetical types’’ or with 
butes of ‘‘level”’ and ‘ f ‘theoretical standard persons” for 
eral distinct composit f tl many years by Burt (1941) and 
profile attributes, as pointed ot y others. Every possible comparison 
the authors. can | d ned bD nif between any arbitrary sets of num- 
vestigator using th ntracla bers and one or more profiles can also 
efficient See! worth be made; in particular, observed 


however, that Cronbach hast ntly profile ‘“‘levels,”’ 


scatter,’ and 
raised a serious questior ) sh ipe”’ indices can be compared to 
lvisability of defini uch ybal riori values of these defined con- 
or composite concepts in | rea Correlations with such a 
ment of dyad ores shape’’ values could be in- 
Only twe tional ts relat ided in a factor analysis or ‘‘direct 
to the presentation of tl tracla " study as well as in an 
correlation will b le here he investigation using intraclass corre- 
first point is tha ly I : ! lation coethcients. Whether any of 
and variances are not iuthor these comparisons is empirically use- 
like “‘poor Clemer lo ful is another matter. 
gone torevet 
53 The data le and 

n ght even be u ed Tf hypot . ONCLUDING REMARKS 
regarding certain interacti is well few of the more fundamental 
is the homogene { { n onceptions or technically incor- 
when the stud operl ( statements contained in the 
duc ted maqu | Der by Haggard et al. (1959) have 
n noted. Other technically ques- 
ible discussions include the au- 
rs’ presentation of the orthogonal 
troid method (p. 50), the possible 
issignment of cases to groups defined 
factor analytic methods (p. 56), 
correct te ite that ‘‘witl pt nd the testing of statistical hypoth- 
dures di u if T ising sets of related observations 


possible to torn 


General theory and methods 
ctoring. Psychometrika, 1944, 


Macmillar i Multiple group methods for 
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REPLY TO PROFESSOR BECHTOLDT’S CRITIQUE 
ERNEST A. HAGGARD 


Department of Psychiatry, University of Illinois 


After rereading Bechtoldt’s cri Some of Bechtoldt’s remarks in his 
tique of our paper (Haggard, et al., sections entitled ‘Direct Correlation 
1959),' I still find it difficult to under Methods" and “Concluding Remarks”’ 
stand fully why he became so exer call for more specific comment, pri- 
cised over it. True, many of the com marily because I think he misses the 
ments in his section on ‘‘Factor An point from time to time. For exam- 
lytic Techniques” are interesting an ple, his apparent reference to the dis- 
informative—and nerally « cussion under the first italicized head- 
tive—but they are also essentially i ing on our p. 54, namely, “When pro- 
relevant to our paper and its purp file means and sigmas are equalized”’ 

It is obvious that Bechtoldt has to do with the simple fact that for 
factor analysis and is steeped in it product-moment r’s (over the nine 
It should also be obvious that I hay subscales or profiles for any pair of 
a somewhat negative loading or Ss), mean = 0 and sigma = 1 by defini- 
factor entitled “One should use fa tion. This is just the point which he 
analysis regardless of whether other emphasizes a couple of paragraphs 
one as wel later when he speaks of ‘“‘the well- 
or better."” There is clear difference of known transformation to standard 
opinion as to whether, when, scores imposed by the definition of a 
factor analytic techniqt hould | product moment correlation.”’ Also, 


1 
i 


methods can get the job d 


used in particular cases Fortu t may be that the expression ‘‘poor 
nately or unfortunately, I have never mentine’’ is unfortunate stylis- 
been enamoured of tl up of ically, but I still fail to see how the 
techniques, and o deal ent r—by itself—provides in- 
with problems of t isually formation as to the means and sigmas 
associated with methods, of the variates correlated. Of course 
would prefer an pro to data the means and sigmas are involved in 
analysis of the 1 by the computation of r but, even 
Creasy (1957 though they may be of great use, 
they are all too often ignored in 

ractice Along these same lines, 
sechtoldt should have gone much 
paper criticized | . than he did at the end of 
replying to his comn this paragraph: he should have em- 
*In a thought! ne ering phasized along with Tukey (1951), 
published several years a res tare = Lindquist (1953), myself (1958) and 


lined some of the we 


1 As senior author 


' 
4 ! 
for any “erroneous, dir P 


ce ved,”’ or “ar I gut 


factor analysis, such 

data needed, the time required to perform the unalysis involving psychological tests, which 
computations, and a number of tec! f far short of 10,000 man hours of testing 
difficulties. With reference oi ‘ p. 53 Creasy (1957) also mentioned 
Horst stated that “If ’ t t var | tations of factor analytic tech- 
with results of ar , ce 1 she s have so many other workers in this 
have 50 or 60 var $ of t least : }ut these matters, which are quite rele- 
500 cases wo { it unt to Bechtoldt's remarks, are too numer- 


take very seriou ] j ar ! t i oO plex to discuss here 
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many others that the estimation of 
components of much 


more useful approach to answering 


variance is a 
many research questions than r is or 
ever can be. 

In his next paragraph, Be 
has us on the when he 
the statement the procedi 
discussed thus far, it is not } 
form groups around such a priori pro- 
files."” We 
practicable” 
strong term” 
that, for the 
workers, these 
possible froma practical point of view 
Finally, in his concludit 
he appears to chide us 


ropes 
‘with 


vossible to 


should have said “not 


1 


instead of using the tooe 


not possible’ toindicate 


majority of research 


procedures ire not 
g paragraph, 
his 


phrase “other technically questiona- 


following 


ble discussions’’) for possibly suggest 


ing ‘‘the testing of statistical hyp 


pothe 
ses using sets of related observations.” 
Now, I thought we were clear on 
that point at least. We took pains to 
observe that, although multivariate 
data can be l 


analyzed properly or ly 


Creasy, Monica A 
an alternative to 
Statist 
19, 318-325 

HaGGarp, E. A 
analysis of variance 
1958. 

HaAGGArD, E. A., CHAPMAN, JEAN P 
K. S., & Dickman, K. Intraclass 
tion vs. factor analyti 
mining groups 
1959, 56, 48-57 


Soc., Serie 


Intracla 


New 


ISAACS, 
orre! i- 
tec hr € tor deter- 


of prohles. Psychol. Bull 


A 


HA( 


by the appropriate multivariate sta- 
tistical techniques, approximate pro- 
for pattern 


iv uilable 


dures studies 


of 
the information in the data without 


analytic 


are which utilize 


most 
violating 
tical issumptions 

In Corl 
Professor 


Trewor! k 


certain important statis- 


clusion, | wish to thank 
Bechtoldt for bothering 
our data and for 


his ibl 1—which 


to 
presenting 
illustrates so 
we made on 
that de 


the italicized values 


point 


nely “any 


would result in the same 


choice to which 


profiles belong 
together 


Accordi " 


form the three groups.” 


to his results, the Ss would 
“d exactly 
d I will leave it to other 


1] irly 


is they are in our 


those MW ho work 


ups of Ss and who do 
iccess to computer facilities 


be 


to decide which methods may 


more useful for their purposes 


tions of tactor 


Pro 


i esting 


il research 
a4 Conference on 
eton, N. J Edu 
1950 
De 
ych y and education 
Mifflin, 1953 
( omponents in 


1951, 7, 33-70 


tional 
S054 
n and analysis of ex 


Bos- 
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that a sound signal such as Yerkes 
used with frogs would also reduce 
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cps. The potentials then fell off rap- 
idly up to 3000 cps, beyond which 
point injurious intensities were re- 
quired to produce a measurable re- 
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according to Kuroda (1926). 
found that lizards would 
open their eyes to tonal stimulation. 
He likened this reflex to Preyer’s 
reflex 


ing 
Kuroda 


“as a reliable clue to make sure 
objectively of the normal state of 
audition.”’ Using a Galton whistle 
he established the upper frequency 
limit at 9675 vs per second (4837.5 
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Summary 


If one is willing to accept the be- 
havioral then 
adult salamanders, 
frogs and teads—can hear. Among 
the reptiles, alligators lizards 

snakes cannot, the 
hearing of turtles is in question. The 


available, 
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Because there has not been a 
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know nothing of the animals’ ability 
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is there 


present state of knowledge of hearing 
in amphibians and reptiles is not 


commensurate with the importance 
of these classes in the study of the 
evolution of the sense of hearing. 
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